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ABSTRACT 


This  is  the  fourth  interim  report  of  a  study  whose  purpose  is  to 
develop  and  apply  a  gen'^ralized  theory  of  the  effects  of  coupling  the 
modes  of  propagation  such  as  occur  in  distributed  microwave  devices. 

In  Sec.  I,  we  review  the  entire  work  done  so  far  in  this  study. 

This  review  is  in  descriptive,  rather  than  mathematical,  terms  to 
indicate  the  purposes  and  interrelations  of  the  various  parts  of  the 
program. 

In  Sec.  II,  the  analysis  of  the  class  of  non>uniform  systems  that 
we  have  been  studying  is  carried  further.  This  is  the  class  of  systems 
such  thc.t  the  derivative  of  the  system  matrix  is  expressible  ss  a 
commutator  of  the  system  matrix  with  another  operator.  This  class 
includes  many  of  the  interesting  non-uniform  systems,  and  it  is  a  class 
that  is  tractable  for  analysis. 

This  analysis  has  been  here  extended  in  several  directions.  In 
Sec.  II-B,  certain  simpler  subclasses  are  studied  in  detail.  The 
2  by  2  case  with  hermitian  system  matrix  is  discussed  first,  and  then 
the  general  2  by  2  case.  The  class  of  systems  such  that  the  system 
matrix  can  be  regarded  as  embedded  in  a  space  that  is  a  representation 
of  the  general  rotation  group  is  then  studied.  Two  formalisms  are 
used,  one  using  the  infinitesimal  transformations  of  the  rotation  group, 
and  the  other  the  corresponding  spin  matrices.  The  results  are  equiv¬ 
alent,  but  the  difference  of  form  may  be  useful. 

In  Sec.  II-C  we  consider  the  general  solution  of  the  equation 
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can  be  solved  for  ▼,  when  T  is  a  singular  operator,  and  then  some  of 
the  general  properties  of  the  commutator  operator.  These  results  are 
used  to  solve  the  system  equation  for  the  operator  A  under  very  general 
conditions.  Thus  we  have  solved  the  problem  of  actually  finding  the 
appropriate  commutator  operator  for  a  non>uniform  system  of  this  type. 

In  Sec.  II-D,  the  solution  of  such  systems  is  considered.  We  show 
that  it  is  possible  to  put  such  systems  into  a  canonical  form  in  which 
the  simplicities  of  the  system  are  exploited.  The  proceedure  is  in 
general  to  find  a  sequence  of  rotating  axes  in  terms  of  which  the  system 
non-uniformity  is  progressively  eliminated.  The  equations  that  determine 
these  axes  are  given  explicitly. 

This  work  is  in  general,  abstract  and  formal,  but  has  been  necessary 
to  provide  a  foundation  for  the  study  of  the  practical  possibilities  of 
mode  coupling  by  appropriately  designed  non-uniformity. 
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I  SUMARY  OF  WORK 


In  this  section  we  shell  review  the  work  of  the  contract  to  date, 
attempting  to  put  the  various  pieces  of  reported  results  into  proper 
relation  with  each  other. 

The  purpose  of  the  contract,  as  stated  in  the  contract,  is  to 
“advance  our  engineering  design  knowledge  and  theoretical  understanding 
of  electron  beam  microwave  devices  through  the  derivation  and  use  of 
the  theory  of  coupling  between  the  modes  of  propagation  existing  in  a 
beam-type  device."  As  stated  in  the  original  proposal,  the  broad 
objective  was  "to  provide  the  analytic  structure  whereby  existing 
design  concepts  can  be  interrelated  and  evaluated  and  new  concepts 
generated."  We  also  stated  that  our  general  approach  would  be  an 
attempt  to  "exploit  some  of  the  more  advanced  techniques  of  matrix 
algebra  and  abstract  vector  spaces." 


In  carrying  out  this  objective,  our  work  has  been  directed  towards 
two  broad  classes  of  structures— the  sectionally  uniform  and  the  non- 
uniform  types. 

By  a  "uniform"  system  or  section  we  mean  one  whose  design  parameters 
do  not  change  along  the  length.  The  electron  beam  velocity  does  not 
change  along  the  distance.  The  impedance  and  propagation  constant  of 
the  transmission  line  are  constant.  The  magnetic  field,  if  one  is 
present,  is  the  same  at  all  points  being  considered.  And  so  on.  A 
non-uniform  system  or  section  is  one  in  which  some  one  or  more  of  these 
parameters  varies  along  the  axis. 


Most  existing  devices  either  are  "  sec tiona 1 ly  uniform"  or  can  be 
treated  as  if  they  were.  A  travelling  wave  tube  may  be  considered  as 
being  built  of  sections  which  are  uniform.  Even  if  the  tube  is  focused 
by  a  periodic  magnetic  field,  the  periodicity  is  carefully  chosen  so 
that  the  results  will  approximate  the  behavior  the  tube  would  have  if 
a  single  solenoidal  field  was  used. 


One 
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exception  is  the  parametric  amplifier  with  travelling  pump 
The  pump  signal  in  effect  imposes  a  time  dependence,  at  the 
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puap  frequency,  of  certain  of  the  paraaetera  of  the  ayatea.  Further, 
if  the  puap  wave  ia  travelling,  there  ia  a  variation  of  phaae  along 
the  interaction  apace.  Thia,  then  ia  a  "non-unifora"  ayatea. 

There  ia,  furthermore,  atrong  indication  that  a  properly  choaen 
non-uniformity  can  lead  to  new  and  potentially  intereating  effecta. 

There  ia,  for  example,  experimental  evidence  auggeating  that 
variation,  with  z,  of  the  magnetic  or  electric  field  under  croaaed 
field  conditions  may  provide  strong  amplification.  If  so,  this  might 
permit  strong  amplification  with  a  very  simple  physical  structure. 

Such  an  effect  might  be  useful  for  several  purposes.  It  is  worth 
considering,  for  example,  if  there  is  need  of  compact  amplifiers  for 
use  at  low  frequencies.  It  would  be  a  good  possibility  for  millimeter 
or  submillimeter  wave  amplification,  except  that  in  this  region  it 
implies  an  average  magnetic  field  strength  that  is  probably  impractical. 
However,  the  thought  does  illustrate  the  potentialities  that  exist  of 
using  a  controlled  non-uniformity  to  achieve  a  desired  mode  coupling. 

The  analysis  of  the  effects  of  non-uniformity  is  still  in  a  very 
unsatisfactory  state.  Certain  situations  of  particular  interest— 
e.g.,  periodic  focusing  and  parametric  interaction— have  been  analyzed 
by  special  methods.  But  no  general  theory  exists.  There  is  no  theory 
that  will  tell  us  what  is  possible  in  such  systems,  or  that  will  give 
us  any  indications  at  all  of  how  the  possible  can  be  achieved. 

Thus,  there  appears  to  be,  here,  a  large  area  of  possible  design 
that  has  not  been  exploited  to  any  significant  degree. 

It  is  then  in  these  two  areas  that  we  have  concentrated  our 
attention. 

A.  SECTIONALLY  UNIFORM  DEVICES 

The  technique  for  the  analysis  of  the  mode  couplings  that  can 
occur  in  structures  that  are  sectionally  uniform,  or  that  can  be  so 
approximated,  was  in  fairly  good  shape  at  the  start  of  the  program. 
Methods  of  analysis  that  are  always  applicable,  at  least  in  principle, 
were  available.  The  principal  phenomena  that  are  possible  in  such 
systems  were  known.  And  a  good  deal  was  known,  for  the  simpler  cases 
at  least,  about  the  conditions  under  which  a  given  phenomenon  such  as 
amplification  or  power  transfer  will  occur. 
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Although  the  principal  phenomena  that  are  poaeible  and  the  condi- 
tiona  for  their  occurrence  were  known,  the  material  waa  widely  acattered 
in  the  literature  and  in  our  work  under  a  previoua  contract  [AF33(616)- 
S8O3] .  Nowhere,  it  aeemed,  had  it  been  brought  together  and  codified 
into  a  aingle  coherent  theory. 

One  of  the  apecific  objectivea  of  thia  contract  aa  indicated  in  the 
contract  itaelf,  waa  to  correct  thia  deficiency.  Therefore,  during  the 
contract  we  prepared  and  iaaued  Technical  Note  1,  "Topological  Analyaia 
of  Sectionally  Uniform  and  Parametric  Syatema."  Thia  covera  in  a  quali¬ 
tative  and  aemi -quantitative  way  the  phenomena  that  are  poaaible  aa  a 
reault  of  the  uniform  coupling  of  modea  by  paira.  In  ao  doing,  we  have 
provided  a  method  for  diagraming  moat  exiating  devicea  in  a  way  that 
exhibita  the  principal  local  effecta  and  allowa  one  to  deduce  the  nature 
of  the  principal  over-all  conaequencea. 

The  report  waa  difficult  to  write  becauae  of  the  danger  of  awamping 
the  reader  in  a  maaa  of  definitiona  and  mathematical  detaila.  The 
preciae  definition  of  terma  and  the  eaaential  mathematical  atructure 
could  not  be  avoided  without  making  the  report  an  eaaentially  auper- 
ficial  one.  But  neither  could  they  be  allowed  to  dominate  the  report 
without  making  the  report  uaeful  only  to  the  expert  in  the  field— who 
preaumably  knowa  the  material  anyway.  The  report  waa,  therefore, 
written  with  great  care  to  preaent  the  eaaential  featurea  of  the  theory 
firat,  filling  in  the  detaila  later  and  in  the  Appendixea.  It  ia  hoped 
that,  by  thia  meana,  it  haa  been  put  into  a  form  that  will  be  uaeful. 

The  Technical  Note  waa  primarily  concerned  with  the  pairwiae 
coupling  of  modea.  The  aimultaneoua  coupling  of  a  larger  group  of 
modea  waa  avoided,  except  for  the  inclusion  of  what  we  call  "chain 
coupling,"  in  Appendix  C.  Thia  ia  a  more  general  type  of  coupling 
that  ia,  mathematically,  cloaely  related  to  pairwiae  coupling.  (Pair¬ 
wiae  coupling  can  be  regarded  aa  the  chain  coupling  of  a  chain  of 
length  two.)  It  waa  included  in  the  Technical  Note  becauae  ita  math¬ 
ematical  development  includea,  without  being  much  more  complicated 
than,  the  neceaaary  mathematical  development  for  aimple  pairwiae 
coupling.  Alao,  it  includea  aituationa  that  may  exiat  aa  diarupting 
phenomena  in  preaent  devicea. 
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The  theory  of  chain  coupling  waa  developed  and  reported  originally 
in  Interim  Report  2,  Seca.  II>B  and  II>C.  The  veraion  of  thia  theory 
in  Appendix  C  of  the  Technical  Note  is  a  rewrite  of  these,  and  containa 
easentially  the  aame  material. 

The  Technical  Note  also  included  the  theory  and  effect  of  pairwise 
parametric  coupling.  This  is  a  non-uniform  situation,  but  one  which 
can  be  handled  in  a  manner  that  is  closely  related  to  the  procedures 
for  uniform  systems.  Also,  of  course,  parametric  coupling  is  used 
in  a  very  important  class  of  existing  devices.  Since  it  was  the  in¬ 
tention  in  the  Technical  Note  to  bring  together,  insofar  as  possible, 
all  the  coupled  mode  theory  necessary  to  the  qualitative  understanding 
of  all  existing  linear  electron  beam  devices,  it  was  appropriate  that 
parametric  theory  should  be  included. 

A  different  approach  to  the  study  of  uniform  systems  was  developed 
and  reported  in  Interim  Report  1,  Sec.  II-A.  This  used  what  we  called 
the  "System  Function  Characteristic."  This  procedure  is  somewhat 
analogous  to  the  transfer  function  used  in  network  theory  for  calculation 
transient  responses.  It  appears  to  be  particularly  useful  in  the 
analysis  of  parametric  situations  in  which  an  indefinitely  large  number 
of  frequencies  may  be  involved. 

Application  of  this  method  was  made  to  a  device  in  which  two 
pump  signals  are  imposed  simultaneously.  Under  suitable  conditions, 
this  will  couple  the  original  signal  to  successively  higher  parametric 
harmonics  as  the  process  continues. 

We  have  suggested  that  this  might  be  a  useful  device  for  the 
generation  of  high  frequencies  by  harmonic  interaction.  A  final 
evaluation  of  the  possibilities  here  has  not  yet  been  made,  but  the 
prospects  do  look  interesting. 

Finally,  we  should  note  one  other  area  of  interest  that  has  not 
yet  been  tackled  at  all.  This  is  the  case  where  statistical  methods 
are  required.  This  may  happen  because  of  the  very  large  number  of 
modes  that  may  be  involved,  such  as  in  the '*  laser . Or  it  may 
happen  because  of  the  statistical  nature  of  the  processes  that  are 
involved.  There  is,  for  example,  the  possibility  of  using  noise  to 
pump  a  parametric  amplifier. 


4 


There  is  need  for  techniques  to  hsndle  these  problems.  No  effort 
hss  ss  yet  been  nsde  in  this  direction,  although  the  group  theoretic 
methods  to  be  discussed  shortly  may  provide  s  useful  starting  point. 
However,  specific  attention  should  be  given  to  this  class  of  problems 
when  time  permits. 

B.  NON-UNIFORM  SYSTEMS 

The  other  major  area  that  has  concerned  us  is  the  analysis  of  non- 
uniform  systems.  No  general  understanding  exists  of  the  possibilities 
of  such  systems.  Only  certain  rather  narrowly  restricted  types  of  non- 
uniform  systems,  such  as  those  using  conventional  parametric  interaction, 
have  been  analyzed.  And  yet  there  is  evidence  that  non-uniformity  can 
provide  the  means  for  obtaining  some  badly  needed  design  capabilities 
such  as  strong  amplification  without  excessively  intricate  structure. 

We  have  made  substantial  progress  on  this  problem  although  it  must 
be  admitted  that  the  results  are  so  far  quite  fragmented  and  have  not 
yet  been  coordinated  into  a  single,  useful  body  of  theory. 

The  earliest  approach  was  by  an  adaptation  of  the  methods  of 
classical  physics.  This  was  reported  in  Interim  Report  1,  Sec.  II-B. 

“A  Formal  Lagrangian  Function  and  Related  Concepts."  It  was  shown 
there  that  it  was  possible  to  define  a  Lagrangian  Function  and  a 
Hamiltonian,  and  to  obtain  from  them  such  related  concepts  as  the 
Poisson  Bracket  and  the  canonical  equations  of  motion.  It  was  also 
found  possible  to  define  appropriate  contact  or  canonical  transformations, 
and  to  classify  systems  according  to  their  infinitesmal  contact  trans¬ 
formations.  Although  these  definitions  are  formal— in  the  sense  that 
the  Hamiltonian,  for  example,  has  little  if  any  relation  to  the  total 
energy  of  the  aystem  —  this  approach  appears  to  open  the  way  to  the 
employment  of  the  various  methods  that  have  proven  so  powerful  in 
classical  and  quantum  mechanics. 

We  have  not  yet  carried  this  approach  further,  mainly  because  of 
the  pressure  of  other  ideas.  It  is,  however,  a  technique  that  we  con¬ 
tinue  to  regard  hopefully,  and  that  we  intend  to  return  to  at  some 
future  time. 

Another  approach,  which  was  partly  generated  out  of  the  preceding 
one,  is  the  use  of  group  theoretic  methods.  This  was  reported  in 
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lateria  Report  3,  Sec.  I.  It  eroee  out  of  the  forael  aethodeeince  the 
infiniteeael  contect  treaeforaetione  of  e  eyetea  do  fora  e  group  that 
ia  ea  appropriate  aeaee  deacribes  the  eyetea. 

The  group  theoretic  deecriptioa  of  e  eyetea  ie  priaarily  ueeful 
for  the  purpoeee  of  claeeificetion  aad  repreaeatatioa.  At  leeat  thia 
haa  beea  the  priacipal  uae  of  thia  approach  ao  far.  It  haa  proven  to 
be  a  powerful  method  of  obtaining  a  convenient  repreaentation  for 
certain  typea  of  non-unifora  ayateaa.  Aa  auch,  it  providea  the  meana 
for  the  further  analyaia  of  auch  ayatema. 

It  can  alao  be  added  that  theae  methoda  are  likely  to  be  highly 
important  when  we  come  to  develop  a  atatiatical  theory  of  ayatema  with 
large  numbera  of  modea,  auch  aa“laaera.'*  In  auch  ayatema,  the  princi* 
pal  information  we  are  likely  to  have  regarding  exiating  couplinga  will 
derive  from  aymmetry  conaiderationa— that  ia,  from  group  theoretic 
propertiea.  It  aeema  likely,  therefore,  that  the  problem  can  be 
made  moat  tractable  by  treating  it  in  at  leaat  a  group  theoretic  type 
of  repreaentation. 

The  third  approach,  which  aeema  to  be  in  the  proceaa  of  yielding 
important  reaulta,  concerna  the  analyaie  of  a  apecial  claaa  of  non- 
uniform  ayatema.  Thia  claaa  includea  parametric  ayatema,  the  exponen¬ 
tially  tapered  tranamiasion  line,  and  moat  of  the  ayatema  which  have 
been  individually  aolved  to  date.  It  alao  includea  much  more  compli¬ 
cated  ayatema.  It  ia  a  claaa,  however,  that  haa  aome  very  important 
propertiea  that  make  ita  analyaia  tractable. 

Thia  claaa  of  ayatema  ia  important,  not  only  for  the  apecific 
devicea  that  it  includea,  but  alao  aa  an  analyaable  claaa  of  conaider- 
able  variety.  Thua,  we  can  uae  it  to  explore  what  ia  poaaible  in 
non-uniform  ayatema  and  to  determine  conditions  under  which  a  given 
type  of  behavior  can  be  obtained. 

The  claaa  referred  to  ia  deacribed  mathematically  aa  that  in 
which  the  derivative  of  the  ayatem  operator  ia  expreaaible  aa  a 
commutator  of  the  operator  with  another  operator  (c/.  Interim  Report  2, 
Sec.  II-A).  It  may  alao  be  aaid  to  be  the  claaa  in  which  the  behavior 
of  the  ayatem  ia  eaaentially  rotational,  although  thia  atatement  would 
require  conaiderable  amplification  in  terma  of  the  geometry  of  the 
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vector  apace  involved,  and  the  neaning  of rotation'*  in  thia  connection. 
Thia  viewpoint  ia  at  leaat  partly  developed  and  exploited  in  Sec.  II-B 
below. 

The  firat  reaulta  and  the  beginning  of  recognition  of  the  full 
claaa  were  reported  in  Interim  Report  2,  Sec.  II-A.  Thia  waa  carried 
on,  and  the  full  claaa  defined,  in  Interim  Report  3,  Sec.  II.  Section  II 
of  the  preaent  report  continuea  varioua  aapecta  of  thia  analyaia. 

Thia  analyaia  ia  atill  aomewhat  fragmented  in  that  we  are  not  yet 
ready  to  pull  together  the  piecea  into  a  coherent  body  of  theory.  It 
ia  alao  atill  quite  incomplete  in  that  the  work  ao  far  haa  been  dealing 
with  varioua  aapecta  of  the  underlying  theory,  with  only  minor  attention 
being  given  to  practical  reaulta.  Thia  haa  been  neceaaary  becauae  of 
the  apparently  almoat  complete  lack  of  previoua  work  on  thia  claaa  of 
problema.  Hopefully,  however,  we  are  now  about  ready  to  study  directly 
the  practical  problema  that  are  the  reason  for  our  interest  in  this 
field. 

It  ia  dangerous  to  predict  progress  in  an  area  about  which  so 
little  is  known  as  thia  one.  However,  it  does  appear  that  there  is 
reason  to  be  hopeful  that  we  are  now  in  a  position  to  make  substantial 
progress  in  the  understanding  of  how  non-uniformity  can  be  used  for 
practical  purposes. 


7 


II  ANALYSIS  OF  NON-UNIFOIM  SYSTEMS 


A.  INTRODUCTION  AND  SUMMARY 

We  ahall,  in  the  varioua  aectiona  that  follow,  continue  our  analyais 
of  non-uniforni  ayatema  of  the  reatricted  class  we  have  considered  in 
Sec.  II-A  of  Interim  Report  2  and  Sec.  II  of  Interim  Report  3.  A 
non-uniform  system  is  one  which  is  described  by  a  vector  differential 
equation  of  the  form: 

■  “/R(z)*(z)  (1) 

dz 

where  z  is  the*'state  vector”  represented  as  a  column  matrix  describing 
the  state  of  the  system  at  the  position  z,  and  R  is  the  system  matrix. 

The  system  is  non-uniform  if  R  is  dependent  on  z.^ 

The  particular  class  of  systems  that  we  will  consider  are  those  for 
which  R(z),  although  not  constant,  is  such  that  its  derivative  is  ex¬ 
pressible  as  the  commutator  of  R  with  some  matrix  A: 

^  -  ;[R,  A]  -  ;(RA  -  AR)  .  (2) 

dz 

Our  interest  in  this  class  arises  from  the  fact,  discussed  in  Sec.  II 
of  Interim  Report  3,  that  certain  subclasses  of  this  class  are  both 
practically  important  and  explicitly  soluble.  Thus,  this  class  may  be  a 
useful  one  for  exploring  what  can  be  done  with  non-uniformity  in  phys¬ 
ical  systems.  Since  use  is  made  of  non-uniformity  in  some  important 
systems  such  as  distributed  parametric  amplifiers,  it  is  not  unreasonable 
to  hope  that  the  study  of  such  systems  may  open  the  way  to  important 
new  devices. 

A*  ••■■1,  w*  •hail  iaSicat*  eolan  aactora  by  bold  facad  leaar  eaaa  lattara,  aad  aatricaa  by  bold  facad 
capital  lattara.  aball  iadicata  tba  coaplaa  coajacata  by  (*),  aad  tba  baraitiaa  coajatata,  or 
eoaplaa  eaajafata  traaapoaa,  by  (f). 
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The  metrix,  A,  ia  not  well  defined  by  Eq.  (2).  We  can  add  to  A 
any  matrix  that  commutea  with  R,  without  affecting  R  in  Eq.  (2).  In 
particular,  we  can  add  to  A  any  function  of  R  and  x,  /(R,  z),  which  ia 
well  defined  [apecifically,  /(u,  z)  ia  a  auitable  generating  function  if 
for  any  value  of  z,  z),  conaidered  aa  a  function  of  u,  ia  analytic 

in  a  aimply  connected  region  that  includea  the  eigenvaluea  of  R  at  that 
value  of  z] . 

We  ahowed,  in  Sec.  II-A  of  Interim  Report  2,  that  a  neceaaary  and 
aufficient  condition  for  the  exiatence  of  auch  an  A  ia  that  the  eigen¬ 
valuea  and  atructure  of  R  be  conatant.  This  doea  not,  however,  indicate 
whether  or  not  A  can  be  chosen  to  be  conatant.  If,  however,  an  A  can 
be  found  which  ia  conatant,  then  Eq.  (1)  is  explicitly  soluble. 

In  Sec.  II  of  Interim  Report  3,  we  showed  that  this  process  is 
generalizable.  In  particular,  if  no  constant  A  can  be  found  for  a 
given  R,  we  may  still  be  able  to  find  an  A  that  has  constant  eigenvalues 
and  structure  so  that  its  derivative  is  expressible  as  the  commutator 
with  a  matrix  D: 


—  -  j(A,  R]  (3) 

dz 

If  a  constant  R  can  be  found,  then  Eq.  (1)  is  still  explicitly 
soluble.  If  not,  but  if  D  can  be  chosen  so  that  is  obeys  the  same  type 
of  equation, 

^  -  j[B.  C]  (4) 

dz 

and  if  C  can  be  choaen  to  be  constant,  then  again  an  explicit  solution 
to  Eq.  (1)  is  obtainable,  although  one  that  involves  an  infinite  series. 

Thus  this  class  of  systems  does  include  some  soluble  subclasses 
of  considerable  variety,  and  the  whole  class  appears  to  have  some  in¬ 
teresting  and  useful  simplicities,  compared  to  the  whole  class  of 
non-uniform  systems. 

The  immediate  need  in  the  analysis  of  this  class  is  for  means  of 
determining  an  A  from  the  properties  of  R  and  its  derivatives  to  provide 
a  tool  for  the  determination  of  the  simplest  A— constant,  if  possible, 
otherwise  expandable  according  to  Eq.  (3). 

If 


Ad  expression  for  sn  A  wss  given  in  Sec.  II>A  of  Interim  Report  2 
in  terms  of  the  mstrix  8(2)  which  reduces  R  to  csnonicsl  form— i.e.,  the 
mstrix  of  eigenvectors  of  R.  Another  expression  wss  given  in  Sec.  II  of 
Interim  Report  3,  expressing  A  in  terms  of  the  dysds  formed  from  the 
eigenvectors  snd  generslixed  eigenvectors  of  R  and  their  derivatives. 

Neither  of  these  expressions  is  entirely  satisfactory  for  deeper 
analysia.  Both  depend  explicitly  on  the  eigenvectors  (and,  if  necessary, 
generalized  eigenvectors).  The  eigenvectors,  however,  may  be  multiplied 
by  arbitrary  acalar  functions  of  z  without  disturbing  the  eigenvector 
properties.  It  is  this  fact  that  leads  to  the  arbitrariness  of  A.  How¬ 
ever,  there  does  not  appear  to  be  any  simple  relation  between  a 
z-dependent  renormalization  of  the  eigenvectors  and  the  resultant  change 
of  A.  It  is  not  clear,  therefore,  how  to  specify  the  normalization  of 
the  eigenvectors  so  as  to  obtain  an  A  matrix  showing  any  desired  prop¬ 
erties. 

In  an  effort  to  overcome  this  difficulty,  we  have  tackled  the  prob¬ 
lem  again  of  determining  A  in  Eq.  (2)  given  R.  This  time  we  have 
sought  a  specification  of  A,  or  of  a  possible  A,  in  terms  only  of  R 
and  ita  derivative. 

In  Sec.  II-B  of  the  present  report,  this  problem  is  solved  for  variou 
restricted  classes  of  R,  of  increasing  complexity.  We  are  able,  for  these 
classes,  not  only  to  obtain  explicitly  a  possible  A,  but  have  gone 
further  and  have  been  able  to  find  A's  that  are  themselves  expandable 
in  the  form  of  Eq.  (3).  Thus,  for  these  classes,  we  have  an  explicit 
procedure  for  developing  a  chain  of  equations  of  the  form  of  Eqs.  (2), 

(3),  (4),  etc. 

This  solution  has  an  importance  beyond  the  purely  theoretic.  It 
includes  the  case  when  the  non-uniformity  introduces  a  pairwise  coupling 
of  the  modes  of  the  uniform  system.  It  includes,  then,  phenomena  of 
considerable  practical  importance.  In  preparation  for  the  detailed  in¬ 
vestigation  of  these  possibilities,  we  have,  therefore,  developed  the 
explicit  solutions  in  different  but  equivalent  representations  that  appear 
to  be  of  possible  interest. 

In  Sec.  II-C  of  the  present  report  we  consider  the  more  general 
class  of  systems,  restricted  only  by  the  condition  that  R  is  of  simple 
structure.  In  this  case,  we  are  able  to  obtain  an  explicit  solution 
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for  A.  This  does  not  complete  the  work  needed,  since  the  A  so  found 
is  not,  in  genersl,  expsndsble  in  the  form  of  Eq.  (3).  We  hsve  not  ■■ 
yet  been  sble  to  find  s  genersl  expression  for  an  expsndsble  A.  Indeed, 
we  hsve  not  yet  proven  thst  an  expsndsble  A  slwsys  exists,  slthough  we 
hsve  gone  fsr  enough  to  feel  fsirly  confident  of  the  truth  of  this 
conjecture.  Nevertheless,  the  development  of  sn  explicit  expression 
for  sn  A,  given  R  snd  R'  end  involving  only  the  eigenvslues  snd  not 
the  eigenvectors  of  R  does  sppesr  to  be  s  substsntisl  step  forwsrd. 

In  the  course  of  developing  this  expression  for  an  A,  we  hsve  been 
led  to  investigste  the  " inversion"  of  a  general  singular  operator. 
Quotation  marks  are  used  here,  because  a  singular  operator,  by  defini¬ 
tion,  has  no  inverse.  Within  the  restricted  context  in  which  the 
problem  arises,  however,  there  does  exist  an  operator  that  has  the 
properties  of  an  inverse.  It  is  in  this  sense  that  we  have  investigated 
the  problem  using  the  techniques  of  abstract  finitely  dimensioned  linear 
algebras.  The  work  is  here  given  in  Appendix  A. 

This  work  also  leads  to  a  more  general  solution  of  Eq.  (2)  for  A. 
Specifically,  it  permits  us  to  remove  the  restriction,  in  Sec.  II-C, 
that  R  be  of  simple  structure.  Since  this  restriction  is  of  little 
practical  importance,  the  development  of  this  generalization  is  re¬ 
legated  to  Appendix  R.  However,  it  may  be  noted  that  the  resultant 
expression  for  A,  even  when  R  is  of  simple  structure  so  that  Sec.  II-C 
applies,  is  not  the  same.  (A,  it  will  be  recalled,  has  some  arbitrari¬ 
ness  to  it.)  It  is  possible  that  this  more  general  solution  may  be 
of  direct  interest  even  where  its  generality  is  not  important.  This  is 
speculation  at  this  point,  but  the  possibility  should  be  kept  in  mind. 

In  Sec.  II-D,  we  have  undertaken  the  formalization  and  systemi zati on 
of  the  process  of  solution  of  a  chain  of  equations  of  the  form  of 
Eqs.  (2),  (3),  (4),  etc.  The  method  of  solution  is  that  used  in  Interim 
Report  3,  Sec.  II-D.  We  find  that  we  are  able  to  interpret  the  process 
as  a  succession  of  z-dependent  changes  of  basis.  The  transformation 
operators  tliat  accomplish  this  are  chosen  to  obey  a  set  of  equations 
that  we  will  call  the  canonical  transformation  equations  of  the  chain. 

B.  NON-UNIFORM  COUPLING  BY  PAIRS  OF  MODES 

We  have,  in  Interim  Reports  2  and  3,  considered  non-uniform  systems 
such  that 
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dx(x) 

dz 


->R( »)*(*) 


(5) 


■7—  ■  ~  j(AR  -  R4)  •  -j[A,  R]  (6) 

dz 

^  -  -  ;[B,  A]  (7) 

dz 


—  •  -  j[C,  B]  (8) 

dz 

etc. , 

where  the  sequence  terminates  when  and  if  a  constant  matrix  is  reached. 

We  have  been  interested  in  such  systems  primarily  because  they  are  much 
simpler  than  the  general  non-uniform  systems,  and  yet  evidently  retain 
much  of  the  potentailities  of  non-uniformity. 

If  A  can  be  chosen  to  be  constant,  then  the  system  is  a  general¬ 
ization  of  the  exponentially  tapered  transmission  line,  and  is  directly 
soluble.  If  A  cannot  be  taken  as  constant,  but  B  can  be,  then  the  system 
is  still  explicitly  soluble  in  closed  form.  If  C  is  constant  and  the 
others  not,  then  it  is  still  explicitly  soluble,  although  the  solution 
involves  an  infinite  sum.  Thus,  while  the  complexity  increases  rapidly, 
as  we  increase  the  length  of  thesequence  of  commutator  relations,  Eqs.  (5), 
(6),  (7),  (8),  etc.,  the  problem  still  retains  a  degree  of  tractabi li ty. 

The  practical  importance  of  these  systems  lies  in  the  fact  that  they 
include  systems  in  which  mode  coupling  is  induced  by  an  appropriate 
non-uniformity  of  the  system  parameters. 

We  shall,  in  this  section,  analyze  in  detail  systems  of  this  type 
in  which  only  a  pair  of  modes  is  involved.  This  is  probably  the  most 
interesting  case  from  the  practical  point  of  view.  The  generalization 
of  the  analysis  to  more  complex  situations  will  be  deferred  until  later. 

We  shall  first  discuss  the  situation  when  R  is  a  2  2  hermitian 

matrix.  This  is  the  simplest  situation.  Historically,  its  analysis 


13 


provided  the  concepts  thet  opened  the  way  to  the  more  general  case.  It 
ia  hoped  that  ita  presentation  here  nay  provide  similar  insight  to  others. 

We  shall  follow  this,  then,  with  an  analysis,  first,  of  the  situa* 
tion  with  a  general  2^2  matrix,  and  then  of  the  situation  when  B  is 
isomorphic  to  the  rotation  group.  In  the  final  case,  we  achieve  a  con* 
siderable  degree  of  generality,  although  still  not  the  moat  general 
case  we  would  like  to  be  able  to  handle. 


1.  K  -  I,  R  Hernitian,  n  -  2 

We  consider  the  case  when  R(2)  is  restricted  to  be  a  2  2  matrix 

that  is  hermitian  at  all  z.  In  our  usual  terminology,  R  is  Af-hermi tian , 
with  K  I,  the  identity  matrix. 

We  have  proven,  that,  for  Eq.  (6)  to  hold  at  all,  it  is  necessary 
and  sufficient  that  the  eigenvalues  of  R  be  constant,  with  the 
z-dependence  of  R  involving  only  a  change  of  direction  of  its  eigen¬ 
vectors. 


We  have  also  shown  that  if  R  is  /f*hermi tian ,  we  can  so  choose 
A  that  it  is  everywhere  Af-hermitian.  The  matrix  A  is  not  completely 
determined  by  Eq.  (6).  To  a  given  A,  we  can  add  any  matrix  function 
of  z  that  commutes  with  R,  without  changing  Eq.  (6).  It  is  for  this 
reason  that  we  can  only  say  that  A  can  be  chosen  to  be  everywhere 
E'hermitian,  not  that  it  must  be. 

If  an  A  exists  —  i.e.,  R  has  constant  eigenvalues  —  then  the  first 
question  we  ask  is,  does  a  constant  A  exist?  If  so,  and  if  we  can 
find  it,  the  problem  is  solved.  If  a  constant  A  does  not  exist,  tlien 
our  next  question  is,  does  an  A  with  constant  eigenvalues  exist?  If 
so,  we  can  write  Kq.  (7)  and  begin  all  over  to  examine  the  properties 
of  B.  If  it  is  impossible  to  find  an  A  that  is  either  constant,  or 
has  constant  eigenvalues,  then  the  whole  procedure  fails  and  we  must 
seek  a  solution  by  other  means. 


The  central  problem  of  this  mode  of  analysis  is  the  question  of 
whether  or  not  it  is  always  possible  to  so  choose  A  that  its  eigen¬ 
values  are  constant,  so  that  we  can  then  find  a  R  that  will  fit  Eq.  (7). 
It  is  with  this  problem  that  we  are  mainly  concerned  here. 


If  we  write  R  as 
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R 


(9) 


then  the  requirement  that  R  be  hermitian  apecifiea  that  0  and  4>  be  real 
functions  of  2,  and  that  ^  be  the  complex  conjugate  of  \p. 


The  characteristic  equation  of  Eq.  (9)  is 


A.2  -  (6  +  <^)\  ■¥  (6<p  -  ■  0 


(10) 


If  the  roots  of  Eq.  (10),  the  eigenvalues  of  R,  are  constant,  then 
Eq.  (10)  must  be  constant,  or  the  trace  and  the  determinant  of  R  must 


can  be 

wri 

tten  as 

a  +  / 

g 

g  ~  jh 

m  "* 

J 

and  /, 

g* 

and  h  are 

lition  that 

h* 

«  a* 

.  1)  with 
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constraii 

;ian  R  w 

i  th 

constant 

(11) 


(12) 


a  being  constant. 

The  matrix  R  of  Eq. 
most  general  form  of  liei 
therefore  is  expressible  as  Eq.  (6). 

We  can  look  for  an  A  of  the  same  form  as  Eq.  (11). 


(M  *  F  G  +  jH\ 

G  -  jH  lit  -  F  I 


(13) 
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where  the  constraint  that 


f*  +  G*  +  ff*  -  6*  (14) 

will  be  applied  later. 

If  we  substitute  Eqa.  (11)  and  (13)  in  Eq.  (6),  we  find  that  we 
require 


/*  -  2(hG  -  gH) 

g'  -  2{fH  -  hF) 

h‘  -  2(gF  -  fG)  (15) 

where  the  prime  means  (d/dz). 

We  may  note  that  n  and  M  do  not  enter  into  these  equations  or  into 
Eqs.  (12)  and  (14).  The  constant  «  is  part  of  R,  but  does  not  affect 
the  choice  of  A.  The  constant  M  can  be  chosen  arbitrarily  and  might 
as  well  be  taken  as  zero. 

If,  now,  we  define  the  S-vectors: 

^  ^ 
u  »  if  *  ig  *  kh 

V  -  ^iF  *  ]g  *  kH  (16) 

where  i,  j,  k  are  the  three  unit  vectors  of  a  cartesian  coordinate 
system,  then  Eq.  (15)  can  be  written  as 

u'  -  -  2  u  ^  V  .  (17) 

In  other  words,  the  right  side  of  Eq.  (15)  is  a  cross  product 
relation. 

Equations  (12)  and  (14)  can,  then,  be  written  as 

u  *  u  ■  o*  (18) 

V  ’  V  ■  6*  .  (19) 
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Equation  (18)  ia  obtainable  from  Eq.  (17),  and  hence  adda  nothing 

new. 

Equation  (17)  atatea  that  u'  ia  perpendicular  to  u  and  v.  The 
vector  V,  then,  muat  have  a  component  that  ia  perpendicular  to  u,  but 
may  have  a  parallel  component.  It  ia  perpendicular  to  u'.  It  ia 
reaaonable,  therefore,  to  conaider  a  aolution  for  v  of  the  form 


V  ■  fliu  u*  +  pu  (20) 

where  a  and  p  are  scalar  constants  or  functions  to  be  determined. 

If  we  substitute  Eq.  (20)  and  in  Eq.  (17)  we  see  that 

a'  ■  ■"  2fltu  X  (u  X  y' ) 

■  -  2a{(u  •  u')u  “  (u  *  u)u*} 


•  +  2«(u  •  u)u' 

*  +  . 

Hence  Kq.  (17)  is  satisfied  if  a  is  constant  and 

1 

a  «  - 

2o^ 

Substituting  Eq.  (20)  in  Kq.  (19),  we  find 


(21) 


(22) 


a*{u  X  u*)  •  (ii  X  u' )  +  2ap(u  x  u')  •  u  +  p*u  •  u 
«*{(«  •  u)(u*  •  u' )  ~  (u  •  u*)(u  •  u*)}  +  o*P* 


4a' 


(u'  •  u')  +  aV* 


(23) 


p*  -  (6Vo*)  -  (u'  •  u')/(4a*) 


(24) 
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and  p  ia  determined  aa  a  function  of  t.  The  conatant  6  muat,  of 
courae,  be  taken  aa  aufficiently  large  ao  that  p*  ia  nowhere  negative 
if  A  ia  to  be  hermitian. 

Hence,  it  ia  poaaible  to  determine  a  matrix  A  that  aatiafiea 
Eq.  (6)  and  that  haa  conatant  eigenvaluea  ao  that  it  ia  expandable  aa 
Eq.  (7). 

We  ahall  not  puraue  thia  further,  here.  Our  main  objectives  were 
to  find  a  method  that  will  allow  us  to  handle  the  more  general  problem, 
and  to  gain  insight  into  the  significance  of  Eq.  (6)  itself.  We  have, 
hopefully,  accomplished  these  objectives  in  discovering  that  we  are 
dealing  with  an  operator  that  can  be  considered  as  the  cross  product 
of  an  entity —  the  vector  u— time  the  operand.  This  suggests  that  the 
operator  j  [“,  R]  be  considered  as,  in  some  sense,  the  cross  product  of 
R  and  the  operand.  It  suggests,  then,  that  we  consider  whether  this 
thought  will  allow  us  to  tackle  the  more  general  case. 

2.  The  General  2  2  R  Matrix 

We  consider,  now,  the  case  where  R  is  a  2  ^  2  matrix  function  of 
z.  Then  the  Cayley>Hami Iton  Theorem  states  that  there  must  exist  an  n 
and  h  such  that 


R*  -  oR  +  6l  (25) 

Furthmore,  a  and  b  must  be  constants  if  the  eigenvalues  are  to  be 
constant. 

If,  now,  we  regard  the  operator  j[“,R]  as  analogous  to  the  operator 
(u  ^  ~),  then  Eq.  (6)  becomes  analogous  to  Eq.  (17).  The  solution 
to  Eq.  (17)  was  found  by  considering  a  form  expressible  as  Eq.  (20). 

The  analogous  form  would  then  be 

A  -  -ja[R'.  R]  +  pR  +  ql  (26) 

For  thia  to  satisfy  Eq.  (6),  we  must  have 

R'  -  -  a(R'R*  -  2nn'R  +  R*R' )  (27) 


II 


Now  using  Eq.  (25)  and  its  derivstive 


m'  +  R'li  >  oR' 


(28) 


we  find  that 


R'R*  -2RR'R  R*R' 


-  R'R*  +  R»R'  -  2R(aR'  -  HR') 


-  R'R*  +  3R*R'  -  2oRR' 


-  R'(aR  +  bl)  +  3(aR  +  bI)R'  -  2oHR' 

-  a(RR'  R'R)  +  4bR' 

“  (o*  +  46)R'  (29) 

This  assumes  that  (o*  +  46)  does  not  vanish.  Ttiere  are  cases  in 
which  it  does  vanish,  so  that  Eq.  (26)  fails.  These,  however,  are 
limiting  cases  and  are,  furthermore,  soluble  by  other  methods.  In 
particular,  it  appears  that,  in  these  cases,  R  always  commutes  with 
its  integral,  so  that 


*(z)  »  [exp  ~iJR(2)ds)*(0) 

is  the  solution  of  Eq.  (5) 

We  will  assume  that  a*  +  46  does  not  vanish.  If,  then,  we  set 

«  ■  l/(a*  +  46)  (30) 

we  see  that  Ivq.  (26)  satisfies  Eq.  (6).  We  note  that  a  is  necessarily 
constant . 

Sk 

Now,  from  Eq.  (26),  we  have  that 

A»  .  -«»[R',  R]*  -  >«p{[R',  R]R  +  R[R',  R]} 

-  2;«q[R'.  R]  +  (pR  +  qj)»  .  (31) 

If 


Considering  theee  terns  in  turn,  with  the  sid  of  Eqs.  (25)  end 
(28),  we  find  thst 

[R'R]<  -  R'HR'R  -  HR'>R  -  R'R*R'  +  RR'IH' 


-  R'R(aR'  -  HI')  -  (oR'  '  R'R)(aR'  -  RR') 

-  R'R*R*  +  (oR*  -  R*R)I«' 

-  -  4R'R*R*  +  4oR*IRI'  -  o»R'» 

-  -  4R'(aR  +  6I)R'  +  4aR'HR'  -a*R'» 

■  “  (o*  +  46)R** 

»  (l/a)R'*  (32) 

and 

{[R'.  R]R  +  RtR'.  R]}  -  R'R*  -  R*R' 

-  R'(aR  +  6l)  -  (oR  +  6l)R' 

>  a(R'R>RR') 

-  o[R',  R] 

=  ;<i(l/a)(A  -  pB  -  ql)  (33) 

and 

[R'.  R]  -  ;(l/rt)(A  -  pR  -  ql)  (34) 

and 

(pR  +  ql)*  ■  p*(aR  +  bl)  +  2pqR  +  q*I 


■  (op*  +  2pq)R  +  (6p*  +  q*)I  .  (35) 


Since  R'  is  also  2  2,  it  must  obey  its  characteristic  equation, 

so  that  there  must  exist  functions  /  and  g,  of  r  such  that 
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r» 


fi  +  «ii' 


(36) 


However,  if  we  prenultiply  Eq.  (28)  by  R'  and  subtract  it  from 
Eq.  (28)  post-multiplied  by  R*,  we  find  that 

■R'*  -  R'»R  (37) 

so  that  R'^  must  commute  with  R.  If  R'  commutes  with  R,  the  problem 
is  trivial.  It  is  simple  to  show,  then,  that  either  R  is  constant  or 
(a^  -f  4h)  is  zero.  If  R'  does  not  commute  with  R,  then,  substituting 
Eq.  (36)  in  Eq.  (37) 


gHR'  -  gR'R 


(38) 


so  that  g  must  vanish,  and 


R'*  •  ft  (39) 

Hence,  Eq.  (32)  becomes 

tR'.  R]*  -  (//a)I  (40) 

If  we  collect  these  results  together  by  putting  Eqs.  (40),  (33), 
(34),  and  (35)  into  Eq.  (31),  we  find  that  the  terms  in  R  cancel  and  we 
are  left  with 


A*  •  (op  +  2g)A  +  (hpj  “  9*  “  <*P9  ”  «/)I  (41) 


We  can  now  choose  p  and  q  so  that  the  coefficients  are  constants. 
One  convenient  class  of  solutions  is  obtained  by  setting 


9  ■ 


(42) 


Then 


A*  +  cl  -  0 


(43) 
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where 


c  ■  “6p*  9*  i^PQ  */ 

■  “  —  (a*  +  46)p*  +  «/ 

4 

»  +  (p*/4a)  +  «/  (44) 

and 

p  ■  {4a(c  “  n/')}*'^*  .  (45) 

For  an  arbitrary  constant  c,  then,  Eqs.  (45)  and  (42)  determine  a 
p  and  q  such  that  A,  from  Eq.  (26),  has  constant  eigenvalues  and  is 
expandable  in  the  form  of  Eq.  (7). 

3.  R  Isomorphic  to  the  Rotation  Group 

We  shall  now  consider  the  problem  in  a  still  more  abstract  manner 
so  as  to  obtain  greater  generality.  Specifically,  we  shall  find  that  we 
can  obtain  these  same  results  if  R  is  embedded  in  a  representation  of 
the  rotation  group. 

We  shall  not  here  discuss  in  detail  what  is  the  significance  of 
requiring  that  R  be  embedded  in  a  representation  of  the  rotation  group. 

For  this,  reference  may  be  made  to  Interim  Report  3,  Sec.  I.  We  shall 
only  assert  the  requirement  that  there  shall  exist  the  set  of  infinites* 
imal  transformations,  Mj ,  Mj ,  and  Mj  such  that  R  is  expandable  in  terms 
of  them: 

R  ■  7,M,  +  rjMj  +  rjMj  (46) 

where  M| ,  Mj ,  and  arf  constant  so  that  tlie  entire  z*dependence  of  R 
is  contained  in  the  scalars  and  y^.  We  require  also  that  these 

infinitesimal  transformation  be  such  that  they  obey  the  commutation  rules. 


(M, 

M,] 

-  HM, 

[M, 

M,] 

-  M 
"2 

[M, 

M,] 

-  -2M, 

(47) 
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We  cen  also  add  to  Eq.  (46)  any  constant  matrix  without  changing 
what  follows  to  any  important  degree,  although  it  does  complicate  the 
procedures  necessary  to  obtain  an  A  with  constant  eigenvalues.  (See 
the  Appendix  of  Interim  Report  3  for  the  elimination  of  auch  terms. ) 

It  should  be  noted  that  the  requirement  that  R  be  embedded  in 
a  representation  of  the  rotation  group  includes  cases  in  which  R  is  of 
any  desired  dimensionality.  As  discussed  in  Interim  Report  3,  Sec.  I, 
there  are  representations  of  the  rotation  group  involving  n  ^  n  matrices, 
where  n  is  any  positive  integer.  The  requirement  does  imply,  as 
indicated  by  Eq.  (46),  that  the  z-dependence  of  R  shall  not  require 
more  than  three  independent  functions  of  z ,  but  this  is  all. 

Since  the  M's  of  Eq.  (46)  are  constant,  we  have  from  Eq.  (46) 

that 


dz 


8,M, 


(48) 


where 


”  dyjdz 


(49) 


Our  natural  course  of  action  at  this  point  would  be  to  determine 
how  we  may  assure  that  R  has  constant  eigenvalues.  It  is  not  evident 
how  to  do  this,  however,  and  we  will  have  to  obtain  this  condition 
indi rectly . 

We  assume,  now  that  A  is  also  expressible  in  the  M’s, 


A  .  ttjM,  +  OjMj  +  o,Mj 


(50) 


TThen  we  find  that 

[A,  R]  -  (n[j7j  -  Ojr,)[M, ,  Mj]  Aklaiy,  -  0j7,)[Mj,  M,] 

+  (a,yj  -  «jr2)[Rz. 

-  -2(aj7j  -  )M,  -  (a,7,  -  o,r,)M, 

+  ((ijr,  -  a,r2)Mj  (51) 
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or  that 


Sj  •  +  «,n) 

S,  -  +  2a^y^)  .  (52) 

We  find  from  Eq.  (52),  that 

+  Sj7,  -  ^  Sj-Vj  -  0  .  (53) 

Equation  (53)  is  the  necessary  and  sufficient  condition  on  R  for 
Eq.  (52)  to  have  a  non-trivial  solution  for  the  S’s.  It  is  therefore 
the  necessary  and  sufficient  condition  that  dR/dr  be  expressible  as  a 
commutator  with  an  A  expressible  as  Eq.  (50),  and  this,  therefore,  is 
a  sufficient  condition  for  R  to  have  constant  eigenvalues. 

Substituting  Eq.  (49)  and  integrating,  we  find  that  Eq.  (53)  re* 
quires  that 

yl  -  4,7 ^y^  •  K  (54) 

where  t]  is  constant.  Equation  (54)  is  also  sufficient  to  assure  that 
the  eigenvalues  of  R  are  constant. 

We  have  shown  that  if  Eq.  (54)  is  satisfied,  a  solution  of  Eq.  (52) 
for  the  a's  exists;  the  solution  is  not  unique,  however.  One  solution, 
as  can  be  easily  verified  by  substitution,  is  given  by: 

"  -jK-Sir,  -  »,7,)  (55) 

-  -;*(», 7,  -  8,7,)  (56) 

a®  -  -;*(-28ir,  +  28,7,)  (.57) 

where 

fc  -  --  .  (58) 

K 
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Then 


Aj  -  aJM,  +  aJlUj  +  ajM,  (59) 

is  a  particular  aolution,  although  not,  in  general,  one  with  constant 
eigenvalues. 

The  matrix  A  will  now  be  expandable  as  Eq.  (7)  if  its  coefficients 
satisfy  the  equivalent  of  Eq.  (54): 

a*  -  4ajaj  ■  fi 

where  is  constant. 

For  this  purpose,  we  let 


a,  ”  +  Sjr,)  +  p^i 

aj  «  (y/<)(Sj-yj  -  Sj^j)  +  pyj 

■  (y//<)(-2S,yj  +  2527,)  +  py,  .  (61) 


That  is,  we  let  Abe  (Ao  +  pR)  where  p  is  a  scalar  function  of  z,  to 
be  determined.  The  addition  of  pR  to  A|j  does  not  affect  the  commutator. 

We  note  that 


(-25,72  +  2527,)^  -  4(-5,73  +  537,)(5273  "  5373) 

1  2 

=  -(5*  -  45,62)(73*  -47,72)  +  ^ 

-  -k(52  -  45,53)  (62) 


so  that 


•273(-25,72  +  2537,) 


472(-5,7j  +  i>37,)  -  47,(5373  -  5373)  -  0  . 


(63) 
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Hence 


«s  -  4aia,  (1A)(8|  -  4Si5,)  +  Kp*  (64) 

and  is  constant  if  we  aet 

p  -  {(mA)  '  (S*  -  4S,S,)A»}»/*  .  (65) 

With  this  function,  A  has  eigenvalues  and  its  derivative  is  ex> 
preasible  in  the  form  of  Eq.  (7).  Since  A  is,  now,  expressed  as  an 
isomorphism  of  the  rotation  group,  the  process  can  be  continued. 

The  only  limiting  condition,  other  than  those  already  stated,  is 
that  K  shall  not  anywhere  vanish. 

4.  R  Expanded  in  Pauli  Spin  Matrices 

It  may  also  be  useful,  at  times,  to  have  these  results  expressed 
in  terms  of  the  Pauli  spin  matrices.  These  may  be  taken  as  the  matrices: 


The  particular  form  of  Eq.  (66)  is  not  significant  for  our  purposes 
here.  What  is  significant  is  the  commutation  rules. 


[ff,. 

•  -2ya, 

[w  2  , 

•  -2/», 

^•3- 

-  -2ja^ 

•  [wj,  ■  [»j,  ■  0 

(67) 

For  a  diacussion  of  the  Pauli  apin  operatora,  the  reader  ahould 
conault  any  text  on  quantum  mechanica,  auch  aa  Dicke  and  Wittke,  Introduc¬ 
tion  to  Quantum  Mechanics  (Addiaon*Wealey  Publiahing  Company,  Inc.,  1960), 
Chapter  12.  The  definitiona  of  theae  matricea  aometimea  rary  in  the  aigna 
that  are  uaed,  ao  that  the  reader  ahould  take  care  to  note  the  particular 
definition  being  uaed.  The  reader  ahould  alao  note  that  the  defined 
in  Eq.  (66)  ia  not  properly  a  apin  matrix,  but  is  the  identity  element. 

Its  inclusion,  however,  makes  the  set  complete,  allowing  the  repreaenta- 
tion  of  an  arbitrary  2^2  matrix  in  the  form 

R  •  rj»j  +  TjWj  +  TjOj  +  r^o^  (68) 

where  the  coefficients  can  be  determined  by  the  orthogonality  relations 

trace  (o^^y)  ■  2S.^. 

ao  that 

1 

r  .  ■  —  trace  (a  H) 

*  2  ‘ 

where  the  trace  of  a  matrix  is  the  sum  of  the  diagonal  elements.  (This 
procedure  is  based  on  the  inner  product  relation  defined  in  Appendix  B, 

Sec.  3). 

It  may  also  be  observed  that  it  is  possible  to  define  a  similar 
set  of  operators  for  any  representation  of  the  rotation  group.  That  is, 
we  can  define  matrices  Oj,  Oj’  Buch  that  the  commutation  relations 

of  Eq.  (67)  are  obtained.  These  matrices  then  provide  a  basis  in  terms 
of  which  we  can  expand  any  R  that  is  embedded  in  this  representation. 

Hence  the  procedures  used  here  are  not  limited  to  2  ^  2  matricea.  The 
development  that  follows  is  applicable  to  any  system  for  which  the  develop¬ 
ment  of  the  preceeding  section  is  applicable.  The  two  analyses  are 
completely  equivalent,  although  there  may  be  considerable  difference 
of  convenience. 

We  consider  now,  an  R  that  is  expressed  in  the  form  of  Eq.  (68). 

Since  the  m  are  constant,  the  derivative  of  R  is; 

R*  ■  rjmj  +  TjOj  +  rjoj  +  (69) 
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We  seek  an  A  of 

the  form 

A 

-  Ujffj  +  a,#j  +  Ojffj  +  a^a^ 

(70) 

1  such  that 

i 

R*  -  -j[A. 

F]  - 

-/(ojr,  -  a,ri)[ai,  Oj] 

-  a,r,)[aj,  o,] 

-yfojrj  -  OjTj)  [sTj,  a  j] 

m 

“2(ajrj  -  a^ry)a^ 

+  2(a,rj  -  OsTjlaj 

“  2(ajrj  - 

(71) 

Hence,  we  must  have 

■  2(ajrj  -  Ojr,) 

1 

( 

'"2 

-  2(a,r,  -  o,r,) 

f 

i 

'•i 

-  2(a^r^  -  o,r,) 

(72) 

1  Furthermore,  must  vanish,  so  that  can,  at  most, 

For  this  to  have  a  non-trivial  solution  for  Oj,  Uj,  Oj,  we 

be  constant. 

must  have 

1 

*  '■2''J  ^  ■  0 

(73) 

1 

1 

1 

r 

f  +  r*  +  r»  -  /l» 

(74) 

1 

I  where  ^4  is  a  constant,  not 

I 

necessarily  real. 

1 
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We  try 


o,  ,  ikir^r^  ~  rjTj) 

■  2k(r^r[  “  r,r{) 

flj  ■  2fc(rjrJ  ~  '■j'"!) 


(75) 


obtained  from  Eq.  (72)  by  interchanging  and  r'  and  multiplying  by 
k.  Substituting  in  Eq.  (72)  and  using  Eq.  (63),  we  find  that  this  is 
a  solution  for  A  providing  k  •  ~l/(4i4^).  Hence  the  “general"  solution  is 


A 


-1 


-  ('‘2 


^  I  ^ 


2A^ 


+ 


{ 


-1 

24* 


(r,i 


+  All  <^1 


(76) 


For  A*  to  be  expandable  as  in  Eq.  (7)  —  t.e.,  as  a  commutator<>-the 
coefficients  of  A  must  satisfy  an  equation  similar  to  Eq.  (74).  Using 
Eqs.  (73)  and  (74),  we  find  that 


(rjr;  - 

-  4»(r;* 


'■3''2)‘ 


Aj 


r  3 


(r,r;  -  Tjr 


;>* 


•'*  +  r'*)  - 


(r,r 


1'  I 


+  r^rj  +  r 


AH, 


'2j 


.*2 


.'*1 


(77) 


Also 


~  ^3''2^  '■2A2'‘i  “  '■l^'l)  '’lAiTj  “  TjrJ) 


(78) 
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Hence  the  condition  thet  A  be  expandable  requirea  that 


—  («■!*  +r;»)  +  dV*  -  B*  (79) 

44*  *  ’ 

so  that  /  is  determined. 

It  should  be  noticed  that  we  have  used  only  the  commutation  relations 
of  Eq.  (67).  Hence,  the  results  are  not  limited  to  the  specific  form  of 
Eq.  (66).  Further,  it  is  not  even  limited  to  the  dimensionality  of 
Eq.  (66).  Specifically,  if  we  are  given  a  set  ,  Mj ,  and  Mjthat  obey 
Eq.  (47),  then  it  will  be  seen  that 

•  Mj  “  Mj 

•  j (M,  +  Mj ) 

Oj  »  2M, 

'  I  (80) 

have  the  commutation  rules  of  Eq.  (67).  Hence,  any  system  that  is 
isomorphic  to  the  rotation  group  can  be  handled  in  this  formalism. 

5.  Conclusions 

We  have  here  developed  the  analytic  technique  for  finding  an  A 
that  is  expandable  as  Eq.  (7),  given  an  R  that  is  expandable  as  Eq.  (6). 

We  have  done  this  for  various  types  of  systems  which  describe  the  effect 
of  coupling  modes,  or  systems  of  modes,  together  by  pairs. 

The  three  cases  considered  are:  (1)  /?  a  general  2^2  matrix, 

(2)  R  expressible  as  a  representation  of  the  three-dimensional  rotation 
group,  and  (3)  R  expressed  in  terms  of  either  the  Pauli  or  the  general¬ 
ized  spin  matrices.  Cases  (2)  and  (3)  are  equivalent  problems,  being 
merely  different  representations,  and  Case  (1)  is  a  subclass  of  the 
others.  They  have  been  given  in  detail  since  one  or  another  of  the 
solutions  may  be  most  convenient  in  different  situations,  and  in  the 
hope  that  general  insight  will  be  gained  by  studying  these  cases  in 
detail . 
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C.  AN  EXPLICIT  SOLUTION  FOR  A  FOR  R  OF  SIMPLE  STRUCTURE 


In  this  section  we  shell  seek  e  solution  of  Eq.  (2)  for  A,  given  R 
end  R' ,  that  will  involve  only  R  and  R'and  the  eigenvaluea  of  R.  That  is, 
we  shall  seek  to  avoid  explicit  use  of  the  eigenvectors  of  R.  The  reasons 
for  seeking  such  a  solution  have  been  outlined  in  the  introduction. 

We  shall  limit  ourselves  here  to  R  of  simple  structure.  That  is, 
we  shall  assume  R  to  have  a  complete  set  of  eigenvectors,  so  that  we 
do  not  need  to  use  generalized  eigenvectors.  This  does  not  appear  to 
be  a  serious  restriction  since  most  practical  systems  do  involve  a 
system  matrix  that  is  of  simple  structure  except  under  certain  limiting 
conditions  such  as  a  band-pass  filter  on  the  edge  of  the  passband, 
or  a  TWT  at  the  critical  degree  of  asynchronism  where  amplification 
is  lost. 

A  more  general  expression  which  does  not  require  R  to  be  of  simple 
structure  is  developed  in  Appendix  B,  based  on  the  abstract  analysis 
of  Appendix  A.  Hence,  even  this  restriction  can  be  avoided. 

We  have  the  problem  then  of  solving  Eq.  (2)  for  A.  We  note  that 
Eq.  (2)  is  of  the  form 

V  -  ;[R,  U]  .  (81) 

We  can  consider  y [R,  ]  as  an  operator  that  maps  U  onto  V.  We  wish 
to  find  an  operator  tliat  will  map  Y  onto  U.  That  is,  we  wish  to  find 
the  inverse  operator. 

The  operator  y [R,  ],  however,  is  singular.  It  "  annihi lates"  any 
U  that  commutes  with  R — e.g.,  R  itself.  Therefore,  it  has  no  inverse. 

It  has  no  inverse  since  the  range,  .Sj ,  of  V,  is  a  proper  subspace, ^  and 
not  the  whole  space,  S,  of  n  ^  n  matrices. 

What  we  need  is  not  a  true  inverse,  because  V  must  be  in  S j ,  and 
not  .S.  What  we  need  is  an  operator,  whose  domain^  is  .Sj ,  and  which 


2 

Hi*  ttt  of  eloMnta  of  tho  *hol«  apoco  for  vliicli  an  oparator  ia  dltfinad  ia  callod  tha  **doiRaiM**  of 
tha  oparator#  The  apace  apaaaad  by  Cha  raaalt  of  applyiag  an  oparator  to  any  a|a»ant  within  ita 
doaiain  ia  tha '*ranfa**  of  tha  oparator* 
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acts  as  an  inverse  within  this  domain.  What  it  does  to  a  V  outside 
this  domain  is  of  no  significance  since  V  is  necessarily  within 


The  problem,  then,  is  to  determine  a  “  restricted  inverse”  to 
7  [R,  ],  whose  domain  is  the  range  of  7 [R,  ].  This  operator,  then,  will 
not  determine  all  1)  that  give  a  possible  V,  but  instead  will  give  one 
possible  U.  It  is  in  this  sense  that  it  is  an  inverse  operator. 


V'e  note  that  7  [R,  ]  is  a  linear  operator;  that  is,  if  and  kj  are 
any  two  scalars  and  lij  and  tij  are  any  two  matrices  in  the  range  of  7  [R,  ]  , 


7[R,  (fejU,  +  jfejUj)]  -  *j7[R,  Uj]  +  Uj] 


(82) 


Therefore,  7  [R,  ]  has  a  minimum  polynomial.  That  is,  if  T  is  the 
operator. 


[R,  ] 


(83) 


there  exist  polynomials  in  T,  <^(T) , 


such  that 


0(T)U  »  0  (84) 

for  all  U.  The  polynomial  of  lowest  degree  is,  then,  the  "minimum 
polynomial . " 


To  interpret  Eq,  (84),  we  must  understand  what  is  meant  by 
power  of  T.  Clearly  this  involves  the  successive  application  o 
leading  to  the  ^-commutators.  That  is,  we  define 

a 

f  T, 

[R,  U]  = 

HU  -  UR 

[jR,  U]  . 

[R,  IR,  U]] 

[*R,  U]  . 

[R[^.,R,  UJ] 

(S.'i) 

3 

This  ststcaent  of  the  problea,  end  soac  of  the  erfuaent  that  folloea,  iaply  thst  it  is  possible  to 
consider  the  set  of  ell  nXn  astrices  es  a  linear  wactor  apacts  This  is  a  valid  vieepointi  althoefh 
aoaevhat  annsoal*  It  depends  upon  the  fact  that  the  set  of  n  X  n  astrices  obey  the  postulates  of  an 
abstract  linear  vector  spacsg  for  which  the  reader  should  consult  any  text  on  abstract  linear  alfebra, 
such  as  Linear  Algtbrm  end  Ifetrix  Theory*  by  Stoll,  (McGraw-Hill  Rook  Co*,  New  York,  N,y,,  1952) 

Sec,  2.2  et,  scf.  In  Appendix  B,  we  shall  exhibit  an  isoaorphisa  between  certain  classes  of  R  and 
particular  vector  spaces  that  are  expressed  in  the  usual  fora  as  a  coluan  aatrtx. 
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and  interpret  T*(U)  as 


T‘(U)  -yMjR.  W  (86) 

In  the  sequence  of  ii •commutators,  it  is  customary  to  define 

[jR.  U]  -  U 

so  that  T*’(U)would  equal  U. 

However,  Eq.  (84)  cannot  then  contain  T**  since  a  U  that  is 
annihilated  by  all  T*  with  k  ^  0— e.g.,  R — is  not  annihilated  by  T**. 
Hence  (^(At),  the  polynomial  function  of  ju  from  which  (^(T)  is  derived, 
cannot  contain  any  constant  term. 

We  will  suppose,  however,  that  does  contain  a  term  in  fi,  and 

will  '^normalize'*  the  polynomial  so  that  the  coefficient  of  this  term  is 
minus  unity.  The  significance  of  this  assumption  will  be  investigated 
later.  For  the  moment  we  will  simply  assert  that  it  appears  to  be 
equivalent  to  the  assertion  that  R  is  of  simple  structure. 

We  will  also  assert  that  contains  only  odd  powers  of  /u.  This 

we  will  later  show  to  be  true. 

It  follows,  then,  that  <t>  can  be  written  as 

0(m)  *  {^’/'(m)  "  1}m  (87) 

or  that 

^(T)T*X  -  W  (88) 


for  any  matrix  X. 

If  this  is  true,  then 


Uo  "  -/'(T)V 


(89) 


is  a  solution  of  Eq,  (81),  where  we  now  write  Eq.  (81)  in  the  form. 


V  -  TO 


(90) 
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We  can  rerify  thia  by  aubatituting  Eq.  (89)  in  Eq.  (90)  and  uaing 
Eq.  (88).  Since  V  is  not  annihilated  by  T,  for  R  of  ainple  atructure, 
we  can, for  this  operand,  cancel  a  T  out  of  Eq.  (88),  so  that  Eq.  (88) 
reduces  to 


i/'(T)iy  -  V  .  (91) 

Equation  (89)  then  gives  us  the  particular  U,  Ug,  for  a  given  Y, 
that  is  within  the  range  of  T.  However,  any  U  whose  projection  on  the 
range  is  U|^>  is  also  a  solution.  We  may  write,  as  the  general  solution, 

U  -  Vo  +  i(z,  R)  (92) 

where  ^(z,  x)  is  any  suitable  function  of  x  and  z.  [it  is  suitable, 
again,  if,  for  any  value  of  z,  ^(z,  x)  is  analytic  in  x  in  a  simply 
connected  region  that  includes  all  the  eigenvalues  of  R.]  Since  the 
added  factor  is  annihilated  by  T,  the  solution  remains  valid. 

[The  particular  form  of  Eq.  (92)  may  not  be  the  most  general  possible 
solution  if  R  is  degenerate.  It  appears  to  be  sufficiently  general  for 
our  purposes,  however.] 

1.  The  Determination  of  4> 

We  must  now  investigate  the  significance  of  the  polynomial  (pin). 

We  will  suppose,  throughout  this  section  that  R  is  of  simple 
structure.  We  do  not  require  that  it  be  non-degenerate,  so  that  some 
of  its  eigenvalues  may  be  multiple,  but  we  do  require  that  there  be  a 
complete  set  of  linearly  independent  eigenvectors.  Suppose  these  are 
the  set  such  that 


R»..  »  X.*, 


(93) 


The  operation  of  any  matrix  I)  on  x^  yields  a  vector  that  can,  in 
turn,  be  expanded  in  terms  of  the  set 


(94) 


S4 


If  we  consider  the  operation  of  the  commutator  on  x^,  we  find 
[R.  V]x.  -  (HU  -  UR)*. 


-  R  I  u,  .*.  -  X.U*. 


U>^ju..x.  -  K.u.jx.) 

-  X.)u. .*,  . 


We  may  guess  that  the  effect  of  the  p-commutator  on  x 
[,R.  U]*.  -  2(\^.  -  \  .)Pu..x^  . 


■  is: 


(95) 


This  we  can  prove  by  induction.  It  is  true  for  p  =  1.  Assume  it  is 
true  for  p.  Then  we  find  that 

[^*iR.  U]*.  .  R[^R,  U]*.  -  [^R,  U]R*. 

-  .*  “ 

-  I(S  . 

Hence  Eq.  (95)  is  proven.  Therefore,  we  find  that,  for  a  polynomial, 
0(m)  in  pt: 


<P(M)  =  Oj  +  oj/x  +  a^pi*  +  ... 

0(T)U*.  -  Ij'a^[^R,  Ujx. 


If,  now,  we  set 
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(96) 


(97) 


(98) 


then 


(99) 


This  will  vanish  for  all  x. providing  is  a  polynomial  whose 

roots  are  the  set  It  will,  furthermore,  be  the  minimum  polynomial 

if  its  roots  are  simple,  so  that  it  has  the  form 


0(/ii)  -  ) 

»/  ’ 


(100) 


where  i,  j  take  only  such  values  that  the  set  is  distinct. 

We  note  that  ■  0  so  that  the  polynomial  so  determined  has  no 

constant  term,  aa  required.  We  also  note  that  /J..  .  ■  so  that  for 

each  non-zero  roo*  there  is  another  that  is  its  negative.  lienee,  the 
non-zero  terms  generate  a  polynomial  in  even  powers  of  //.  Multiplying 
by  for  the  factor  corresponding  to  the  zero  root,  we  find  that 
contains  only  odd  powers  of  fx,  as  required. 


The 
non-zero 
Eq.  (87) 


coefficient  of  the  first  power  of  fx  is  the  product  of  all  the 
H’ij-  Hence,  the  minimum  polynomial  normalized  according  to 
is 


-n(^  ■  ) 

0(m)  >  -  (101) 

nv 

where  i,  j  take  values  such  that  each  possible  ^x^^  occurs  just  once, 
and  where  the  prime  on  the  product  symbol  in  the  denominator  indicates 
that  the  factor  ^x^^  ■  0  is  omitted  from  the  product. 

We  note  that  we  cannot  have  all  eigenvalues  of  R  the  same.  Since 
R  is  assumed  to  be  of  simple  structure,  it  would  then  be  a  scalar  func¬ 
tion  times  the  identity.  Hut  the  constancy  of  the  eigenvalues  would 
require  that  the  scalar  function  be  constant.  Hence,  the  whole  problem 
becomes  trivial.  Excluding  this  trivial  case,  there  must  exist  a  set 
of  non- zero  u .  . 

We  conclude,  then,  that  a  <P(fx)  of  the  form  specified  in  Rq.  (101) 
must  exist  for  any  R  of  simple  structure. 
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As  a  note  of  parenthetical  interest,  we  can  observe  that  0(/^)  is 
the  minimum  polynomial  of  the  matrix 

(jl  ^  H  -  jR  X  I)  (102) 

where  the  symbol  x  indicates  the  “Kronecker”  or  “direct"  product.* 

D.  THE  CANONICAL  TRANSFORMATIONS  OF  A  CHAIN  OF  EQUATIONS 

In  this  section  we  will  consider  the  solution  of  the  chain  of 
equations : 


— 

dz 

dz 


0 

;[B.  A] 


dz 


;[C,  B] 


etc. 


A  -  A„ 

B(0)  -  Bj 

C(0)  -  Co 


(103) 


The  class  of  non-uniform  systems  described  by  such  a  chain  is  that 
considered  in  Sec.  I,  and  in  Interim  Report  2,  Sec.  II-A,  and  Interim 
Report  3,  Sec,  II.  This  class  is  of  great  interest  since  it  is  a  class 
that  exhibits  some  of  the  potentialities  of  non-uniformity,  but  is  at 
the  same  time  analyzable.  It  includes,  for  example,  the  important  class 
of  parametrically  coupled  systems. 

We  will  consider  here  the  solution  of  such  a  chain.  Specific 
solutions  for  the  simpler  cases  have  been  given  in  Interim  Report  3, 

Sec.  II-D.  We  shall  generalize  the  procedure  used  there  and  obtain  a 
process  that  can  be  repeatedly  applied  to  the  chain  to  effect  a  re¬ 
duction  of  its  complexity. 

It  should  be  noted  that,  in  Eq.  (103),  we  have  reversed  the  order 
of  the  chain  from  our  usual  designation.  The  matrix  A  is  the  one  that 


for  •xiwpre  Rs  Introduction  to  Matrix  Anxlytit,  (McGravHill  RcK)k  Co#,  lac#  Ne*  York, 

ffsYs,  1960)  Chaptor  12,  or  othar  ataadard  taxt* 
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ia  constant.  (We  are  not  concerned  here  with  chains  that  do  not 
terminate. )  The  matrix  B  is  the  one  which  can  be  regarded  as  represent¬ 
ing  a  simple  rotation  around  a  constant  axis.  The  matrix  C  then 
represents  a  rotation  around  an  axis  that  is  itself  rotating  about  a 
constant  axis.  And  so  on.  (This  geometric  interpretation  is  valid, 
but  it  should  be  remembered  that  the  vector  space  involved  may  have 
many  dimensions  and  a  not-positi ve-def inite  metric.  In  relativistic 
terms,  some  of  the  dimensions  may  be *' time- like. ”  Hence,  the  geometry 
of  the  space  may  be  quite  different  from  Euclidean  space.)  The  system 
operator,  in  our  usual  sense  of  this  term,  is  then  the  highest  member 
of  the  chain  in  which  we  are  interested. 

This  change  of  order  is  convenient  for  our  purposes  because  our 
procedure  will  involve  the  successive  isolation  and  elimination  of  the 
complexities  introduced  by  t)te  lower  members  of  the  chain.  We  shall, 
in  the  first  step,  change  the  basis  to  a  rotating  one,  such  that  B 
becomes  constant  and  C  becomes  a  description  of  a  simple  rotation  about 
a  fixed  axis.  Then  we  shall  make  a  second  change  of  basis  such  that 
C  becomes  constant.  And  so  on,  up  the  chain.  We  shall  finally  obtain 
a  set  of  equations  for  the  changes  of  basis  that  will  effect  these 
simplifications.  While  these  equations,  for  the  higher  terms,  are  not 
simple,  they  are  in  principle  soluble.  Thus,  we  shall  establish  a 
canonical  procedure  for  the  analysis  of  such  a  chain. 

j  The  only  restriction  that  we  shall  impose,  other  than  the  implied 

one  that  the  system  matrix  is  expandable  as  a  chain  of  finite  length,  is 
that  system  everywhere  exhibits  a  known  conservation  law.  As  discussed, 
for  example,  in  Technical  Note  1,  Appendix  A,  and  Interim  Report  3, 

Sec.  II-B,  this  requires  that  the  system  matrix,  R,  be  A-hermi tian 
(KR  -  R^K).  And,  as  shown  in  Interim  Report  3,  Sec.  II-C,  Theorem  3, 

i  we  can  then  choose  all  the  members  of  the  chain  so  that  they  are  every- 

I 

I  where  A'hermitian,  We  are  then  justified  in  assuming  that  all  the 

I  matrices  of  Eq.  (103)  are  everywhere  A-hermitian. 

! 

{  1.  Fibst  Step 

I. 

We  have  specified  the  chain  so  that  A  is  a  constant  matrix,  . 

We  can  use  this  fact  to  obtain  a  z-dependent  change  of  basis  that  will 
make  B  a  constant. 
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We  conaider  the  change  of  baaia  induced  by  a  tranaforaation  Uj 
that  ia  X-unitary: 


UflOUi  -  K  (104) 

Evidently  l)|  ia  non*aingular,  aince  K  ia,  and  ao  may  be  uaed  aa  the 
tranaformation  matrix  of  a  change  of  baaia: 

A,  -  Uj‘AU, 


B,  . 


etc. 

If,  now,  we  define 

dUj 

M.  - 

then  M|  is  f(>hermi tian ,  since 

K- ‘M|K 


dUj 

-;K** - 11} -‘K 

dz 


(105) 


(106) 


■  *•) 

dU, 

dV, 

lU  *  • 


(107) 


Furthermore,  Bj ,  Cj ,  etc.,  are  X>hermi tian,  since  B  is  Af-hermi tian ,  and 

K-'BtK  -  K-'UlB^Uj-'K 


-  B;'E-*Bt|(IJ,  -  Ul‘BU,  .  B, 


(108) 
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Now  differentiating  the  expreseion  for  Bj 


in  Eq.  (lOS),  we  find 


dB| 

dt 


dl]|  dll,  dB 


-  jMjBj  “  /B|Mj  +  j  [Bj ,  Aj  ] 

-  i  iBj,  (Aj  -  Mj)] 


(109) 


Similarly,  we  have  that 


dCj 

■—  •  j  [C, ,  (Bj  -  Mj)] 
dz 

dDj 

— -  -  (C,  -  M, )] 

dz 


etc.  (110) 


That  is,  if  the  original  sequence  was  K-hermitian,  and  if  we  choose 
IJj  to  be  X'Unitary,  then  the  new  sequence,  Aj ,  Dj ,  Cj ,  etc.,  are  still 
/(•hermitian. 

We  were  given  that  A  was  constant.  If,  then,  we  set 

-  Aj  "  A,U,(0)  -  I  (111) 


then  Eq.  (109)  reduces  to  dBj/dz  »  0,  and  Bj  is  constant.  To  accomplish 
this,  we  require  that 


dU. 

-  «;‘AUj  .  Ul'A,Uj  (112) 


dU, 

~  -  -7A,U,  (113) 

U,  -  exp  {-jK^z) 

4« 


(lU) 


Furthermore,  this  Uj  im  indeed  /f-unitery  since  Ag  is  A-hermitisn. 
It  is  also  non-singular  for  any  A^. 

We  have,  then,  replaced  the  original  X-hermitian  sequence  by  a 
new  X-hermitian  sequence  in  which  B|  is  now  constant.  Furtheremore, 
since  the  transformation  is  the  identity  one  at  z  ■  0,  we  must  have 


B.  -  Bo 


(115) 


We  can  now  continue  the  process. 


2.  Second  Step 


We  consider,  as  before  a  X-unitary  transformation  matrix  Vj  and 
define  the  X-hermitian  matrix  Nj  as 


dVj 

N,  -  JV-  —  . 


(116) 


We  now  let 


B  »  V'*B  V 
“a  ’a  "I’a 


C,  .  V;>C,V, 

etc. 


(117) 


and 


M, 


Vi%V, 


(118) 


Then 


dz 


dV.  dV,  dC, 

,  -Va.  — 

.  (Bi  -  lil,)3V, 


>[C,.(B,  -  -  Nj)] 
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(119) 


I 

f 


j[Dj,  (Cj  -  Mj  -  Nj)]  etc. 


(120) 


We  choose  Vj  so  thst 


Nj  -  B,  -  Mj 


B,  -  A, 


Vj{0)  -  I 


(121) 


.V.i_  .  .  V->(B,  -  A,)V, 


->(B,  -M,)V,  .  ->(B„-A,)V„ 


(122) 


Vj  »  exp  {-j(Bj  -  A,)2} 


(123) 


This  choice  makes  Cj  a  constant.  At  2  ■  0,  both  Vj  and  Uj  are 
the  identity,  so  the  constant  must  be 


3.  Third  Step 


(124) 


Continuing,  we  set 


s  “ 


(125) 


^3  =  ^*3%^ 


(126) 


and  Dj  will  be  a  constant  providing  we  set 


P,  •  C,  -  Mj  -  Nj 


(127) 


I 


or 


~  .  -;  (C,  -M,  -N,)W, 

-  -j  (C,  -B,)W, 

-  ->(Co  -  V;‘BjV,)W,  .  (128) 

4.  The  TRANsroRHATioN  Equations 


The  key  equations  can,  then,  be  summarized  as  follows: 


dU, 

m 

dz 

u,(0)  -  I 

(129) 

dl 

- 

v,(0)  -  I 

(130) 

II 

-;{c,  - 

w,(0)  -  I 

(131) 

II 

-;<D,  - 

w;‘c.w,)X. 

X,(0)  -  I 

(132) 

etc. 

It  happens  that,  because  of  the  simplicity  of  the  first  equation, 

Uj  commutes  with  and  the  second  equation  is  simple.  This  is  not  true 
in  the  succeeding  equations,  however.  Their  complexity  grows  very 
rapidly. 

If  we  write  the  general  equation  of  this  sequence  of  Eqs.  (129), 
(130),  (131),  (132),  etc.  as 

~  -  -/{F.  -  S'*C,S}R  (133) 

ti  9  ^  * 
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I 


we  can  wake  the  aubatitution 

R  -  e"''®*  R  (134) 

since  is  conatanti  and  obtain 

—  jS-‘6.SR  (135) 

dx  " 

where 

S  -  .  (136) 

Hence,  the  entire  ayatem  can  be  reduced  to  the  form  of  Eq.  (135). 
It  ia  theae  equationa  for  the  appropriate  baais'tranaformationa  that  we 
conaider  the  canonical  equationa  for  the  original  chain.  The  aolution 
of  the  original  chain  haa  been  made  to  depend  on  the  aolution  to  the 
canonical  chain  of  equationa  of  the  form  of  Eq.  (135). 
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Ill  PROGRAII  FOB  THE  NEXT  INTERTAL 


During  the  next  interval  work  will  continue  on  the  problem!  atudied 
here.  It  is  thought  that  we  are  now  in  a  poaition  to  begin  the  inter¬ 
pretation  of  some  of  these  results  in  terms  of  physical  processes  that 
are  of  interest  for  devices. 

One  tool  for  this  process  is  the  "  Baker-Hausdorf f  Formula.”  This 
is  a  formula  giving  the  solution  of 

exp  C  *  (exp  4)(exp  B) 

for  C  when  A  and  B  are  non-commuting  operators.  Since  the  class  of 
non-uniform  systems  that  we  have  been  considering  are  solved  as  the 
product  of  exponentials,  the  determination  of  the  behavior  of  the  system 
as  a  whole  can  be  investigated  conveniently  with  this  formula. 

We  intend,  also,  to  begin  consideration  of  ways  of  developing  a 
statistical  theory  of  mode  coupling.  This  is  a  new  topic,  but  one  that 
is  important  for  many  practical  problems. 

Finally,  surveillance  of  the  literature  will  be  continued.  No 
report  on  this  aspect  has  been  included  in  the  present  report,  but  will 
be  continued  in  the  next  Interim  Heport. 
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APPESbIX  A 


THE  “INVERSION"  OF  SINGIIAR  OPERATORS 
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APPENDIX  A 


TK  INVERSION'*  OF  SINGULAR  OPERATORS 


In  this  appendix  we  ahall  be  concerned  with  the  solution  for  Y  of 
the  equation 

B  -  Tv  (A-l) 

when  u  is  known.  T  is  assumed  to  be  a  linear  operator  which  may  be 
singular. 

A  singular  operator,  of  course,  has  no  inverse.  This  is  whst  we 
mean  by  calling  it  singular.  Nevertheless,  Eq.  (A-l)  is  soluble  in  the 
sense  that,  for  a  given  u,  v  is  determined  within  certain  limits. 

The  singularity  of  T  enters  through  the  statement  that  not  all  a 
will  generate  an  appropriate  r.  However,  this  does  not  contradict  the 
problem  since  the  statement  of  the  problem  implies  that  u  is  in  the  range 
of  T.  That  is,  if  we  can  write  Eq.  (A*l)  at  all,  then  u  must  be  in  the 
subspace  such  that  some  v  exists.  The  problem,  therefore,  is  to  find  an 
operator  that  acts  as  an  inverse  for  T  when  operating  on  the  range  of  T— 
t.e.,on  the  subspace  that  is  obtained  by  operating  on  all  vectors  with  T. 
We  do  not  care  what  this  operator  may  do  on  any  other  vector. 

The  problem  may  arise  in  a  number  of  different  contexts.  In  vector 
analysis,  we  may  meet  the  problem  of  finding  v,  given  u,  when 

u  *  a  X  V  .  (A- 2) 

The  operator  (a  x)  is  easily  seen  to  be  a  linear  operator.  It  is  s 
singular  one  since  the  cross  product  vanishes  if  v  is  a  scalar  times  a. 
And  yet  it  msy  be  important  to  be  able  to  solve  such  an  equation  for  v. 

Another  problem  that  is  of  considerable  interest  is  to  find  the 
matrix  V,  given  U,  when 

V  •  ytR.U]  •  ;(RU-IR)  (A-S) 
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t.«.,  «h«B  V  ia  the  eoaautator  of  R  and  1).  The  apace  of  n  ^  n  aatricea 
can  be  conaidered  a  linear  vector  apace.  Thia  aay  aeea  aoaewhat  atrange, 
but  it  can  be  eaaily  aeen  that  thia  apace  doea  obey  the  poatulatea  of  a 
linear  vector  apace.  The  operator 

T  -  >»,  ]  (A.4) 


ia  a  linear  operator.  It  ia,  however,  a  aingular  one  aince  V  vaniahea 
for  any  U  that  coamutea  with  R,  auch  aa  R  itaelf. 

Thia  form  ariaea  in  varioua  quantum  mechanical  problems.  It  alao 
occura  in  coupled  mode  theory.  The  claaa  of  ayatema  with  ayatem  matrix, 
R,  for  which  a  matrix  A  exiata  auch  that 


dt 


;tR,  A] 


(A-S) 


ia  a  claaa  of  **non*uniform"  ayatema  which  are  cunaiderably  simpler  than 
the  general  non-uniform  ayatem.  If  A  can  be  choaen  to  be  constant,  it 
is  then  possible  to  solve  the  system  exactly  and  explicitly.  This  class, 
in  fact,  includea  most  of  the  solved  non-uniform  systems,  such  as  the 
exponentially  tapered  tranamiasion  line  and  the  distributed  parametric 
amplifier.  Thia  ia,  then,  a  class  of  systems  that  justifies  intensive 
study. 

In  the  study  of  Eq.  (A- 5),  our  immediate  concern  is  to  determine  if 
the  system  operator  R(2)  is  governed  by  an  equation  of  the  form  of 
Eq.  (A-S).  It  can  be  shown  that  the  necessary  and  sufficient  condition 
for  thia  ia  that  the  eigenvalues  and  structure  of  R  be  constant. 

Having  determined  that  this  ia  tfue,  our  next  concern  is  to  determine 
what  A  may  be,  knowin;;  R(z)  and  hence  dR/dz.  This,  then,  is  the  problem 
of  " inverting"  the  singular  operator  j  [R,  ], 

We  note  that  we  do  not  expect  a  unique  solution  to  Eq.  (A-Sl.  Given 
an  A  that  satiafies  Eq.  (A-5),  we  can,  evidently,  add  to  A  any  matrix 
function  of  z  that  everywhere  commutes  with  R(z)  without  affecting 
Eq.  (A-5).  There  is,  then,  a  "pencil"  of  solutions. 

In  more  abstract  terms,  given  Eq.  (A-1)  with  T  a  linear  operator.  If 
V  i a  any  solution  to  Eq.  (A-l)  for  a  given  ■,  and  if  i a  any  vector  such  that 


5t 


(A.6) 


Tto  •  0 

then  (v  OVg )  ia  a  solution,  a  being  any  scalar. 

Hence,  if  T ia  singular,  the  solution  to  Eq.  (A-1)  ia  not  unique. 

To  discuss  this  problem  we  shall  first  discuss  these  complications 
in  a  precise  way. 

1.  ABSTRACT  FORMULATION 

We  will  start  by  defining  certain  important  concepts. 

Definition:  The  “ domain,  ”  ,  of  an  operator,  T,  is  the  subspace 

of  vectors  such  that  the  operation  of  T  is  defined. 

Normally,  we  consider  the  domain  of  a  matrix  operator  to  be  the 
whole  apace.  We  do  this  because  there  is  really  no  reason  to  do  other¬ 
wise.  (This  is  not  true  in  infinitely  dimensional  spaces  with  either 
infinite  matrices  or  integral  or  differential  operators.  In  such  spaces, 
it  may  not  be  possible  to  apply  a  given  operator  to  any  operand  without 
introducing  divergencies.  It  is  then  necessary  to  restrict  the  domain  to 
assure  convergence.  In  finitely  dimensional  spaces,  such  as  is  our  con¬ 
cern,  this  problem  does  not  arise  so  that  we  usually  have  no  reason  to 
Specify  the  domain  as  other  than  the  whole  space. )  However,  in  the 
present  situation,  we  are  seeking  an  operator  whose  operand  is  known  to 
be  confined  to  a  given  subapace.  This  operator  will  have  significant 
meaning  only  for  the  domain  that  is  spanned  by  the  possible  vectors  ■. 

Definition:  The  "  range, ’’ Sjj,  of  an  operator,  T,  is  the  subspace 

spanned  by  the  vectors  obtained  by  T  operating  on  any  vector  in  its 
domain.  We  can  express  this  formally  by  writing 

S*  •  TS, 

or  by  saying  that  the  range  of  T  is  T  operating  on  its  domain. 

The  singularity  of  T  is  expressed  by  saying  that  its  range  is  a 
proper  subspace — i.e.,  is  not  the  whole  space. 

Definition:  The  "null  space, ’*  of  an  operator,  T,  is  the  subspace 
of  its  domain  such  that  T  operating  on  any  vector  in  vanishes: 

T*  ■  0  ifxisinSjy 

.*»! 


The  eingulerity  of  T  cen  alao  be  expreaaed  by  stating  that  the  null 
space  of  T  is  a  proper  aubapace— t. ,  not  the  space  containing  only  the 
null  vector. 

Dgfinition:  A  set  of  linearly  independent  vectors  Xj,  x^  are 

called  "progenitors  of  the  range  of  T"  if  the  set  TXj,  Txji  •••  form 
a  basis  for  the  range  of  T. 

Note  that  the  specification  that  Txj,  ...  Tx^  form  a  basis  implies 
that  (1)  they  are  linearly  independent,  and  (2)  none  is  the  null  vector. 
Hence  a  aet  of  progenitors  must  contain  exactly  k  vectors,  if  k  is  the 
dimensionality  of  the  range. 

Further,  the  vectors  Xj  ...  X^  must  be  linearly  independent.  For, 
if  there  existed  a  non  trivial  set  of  scalars  such  that 

-  0  (A-7) 

then,  since  T  is  linear,  we  would  have  that 

TZ^i*i  “  ‘  ®  tA-B) 

and  the  set  Tx^  would  not  be  linearly  independent. 

Hence,  any  set  of  progenitors  spans  a  subspace  which  we  will  call  a 
"progenitor  apace."  Tliis  space  is  not  uniquely  defined,  if  T  is  singular. 
It  has,  however,  the  virtue  that  the  operation  of  T  on  any  progenitor 
space  is  non-singular.  That  is,  to  every  vector  in  any  given  progenitor 
apace  there  corresponds  one  and  only  one  vector  in  the  range  of  T,  and 
vice  versa. 

In  solving  Fq.  (A-1),  then,  we  know  that  u  is  in  the  range  of  T.  We 
wish  to  specify,  for  each  U,  a  unique  v  that  is  a  particular  solution  to 

Eq.  (A-1).  That  is,  we  wish  to  specify  a  particular  progenitor  space. 

The  mapping  of  the  progenitor  space  onto  the  range  is,  then  one-to-one. 

If,  in  addition,  the  progenitor  space  can  be  taken  as  identical  to  the 
range,  then  this  mapping  can  be  expressed  as  a  non-singular  operator  with 

this  subspace  as  its  domain.  The  problem  is  then  directly  soluble,  giving 

a  particular  solution  to  Eq.  (A-l).  The  general  solution  is  then  obtained 
by  adding  to  the  particular  solution  any  vector  in  the  null  apace  of  T. 
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This  i«  not,  hovever,  nlvnyo  possible  to  do.  If  T  hss  s  chsin  of 
of  eigenvectors  end  genersliied  eigenvectors  with  lero  eigenvslue,  then 
the  eigenvector  thst  hesds  the  chsin  is  both  in  the  null  space  of  T  and 
in  the  range.  It  is  in  the  null  apace  since  it  is  annihilated  by  T.  It 
is  in  the  range  since  it  is  generated  by  the  generalised  eigenvector  of 
rank  two.  Since  it  ia  in  the  null  space,  it  is  not  by  itself  a  pro* 
genitor.  Therefore,  the  progenitor  space  cannot  be  identical  with  the 
range.  While  the  napping  of  a  progenitor  space  onto  the  range  is  still 
one*to*one,  it  ia  not  a  napping  of  a  subspace  onto  itself.  In  this  case, 
we  will  have  to  use  a  more  involved  procedure. 

Before  going  into  the  general  procedure,  however,  we  will  consider 
a  sinpler  case  in  which  this  problen  does  not  arise. 

2.  T  OF  SIMPLE  STRUCTURE 

We  will  consider  first  the  case  when  T  ia  of  sinple  structure.  This 
is  not  the  nost  general  situation  that  avoids  the  difficulty  cited  in  the 
previous  section.  We  could  admit  non-simple  structure  involving  chains 
with  non-tero  eigenvalues.  However,  it  does  appear  to  include  moat  situ¬ 
ations  of  interest.  The  remaining  situations  in  general  seem  always  to 
include  at  least  the  threat  that  a  zero-eigenvalued  chain  may  exist,  so 
that  it  is  then  better  to  use  the  more  general  procedure  that  we  will 
discuss  shortly. 

We  have,  then,  the  following  theorem 

Theorem  1:  If  T  is  of  simple  structure,  the  set  of  eigenvectors  with 
non- zero  eigenvalues  are  both  a  basis  for  the  range  and  a  set  of  pro¬ 
genitors — i.e,,  a  basis  for  a  progenitor  space. 

This  follows  directly  by  taking  as  a  basis  for  the  whole  space  the 
whole  set  of  eigenvectors,  which  is  complete  since  T  is  of  simple 
structure. 

This  theorem  can  also  be  stated  as: 

Corollary:  If  T  is  of  simple  structure,  the  range  of  T  ia  a  pro¬ 

genitor  space  of  T. 

Thus  this  tells  us  that,  in  Eq.  (A-1),  we  can  restrict  T  to  be 
within  the  range  of  T  without,  by  so  doing,  restricting  the  range  of  n. 
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We  can,  in  other  worda,  be  certain  that,  for  any  n  that  can  occur  in 
Eq.  (A'l) — i.e.,  ia  in  the  range  of  T  —there  exiata  an  X  auch  that 

u  -  T**  (A-9) 

Then,  the  particular  v,  that  ia  within  thia  progenitor  space  ia 
given  by 

V,  «  Tx  (A-10) 

The  minimum  polynomial  of  T — i.e.,  the  polynomial  function  that 
annihilates  any  vector  in  the  whole  space — can  be  written  as 

«I»(T)  -  n(T  -  X.I)  (A-11) 

where  the  product  is  taken  over  all  dietinct  values  of  That  is,  if 

k.  is  a  multiple  root,  the  factor  (T  ~  X^I)  appears  only  once.  That 
this  is  the  minimum  polynomial  follows  from  the  fact  that  T  is  of  simple 
structure. 

Since  T  is  assumed  singular,  zero  is  an  eigenvalue.  Hence  <l>  con¬ 
tains  as  a  single  factor,  T  itself,  and  can  be  written  as 

«i>(T)  =  «,T  +  +  .  .  .  (A.12) 

where 

Uj  =  n'(-\.)  (A-13) 

the  prime  indicating  that  the  product  is  over  all  distinct,  non-zero 
values  of  k^.  Hence,  Oj  is  not  zero,  and  <i>(T)  can  be  written  as 

•iid)  •  {oj  +  (Oj  +  OjT  +  ...)T}T 

-  a,{l  -  V/dim  U-14) 

where 

0(1-)  .  -  —  (a.  +  oj  +  ..  .  )  (A-15) 

a,  ^  * 
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The  function  4>(T)  ia  a  function  that  annihilatea  any  vector,  X. 
Hence,  for  any  X,  «e  have  that 

Tx  -  i/'(T)T*x  (A.16) 

If,  now,  we  premultiply  Eq.  (A-9)  by  i/'(T),  we  find 

(A(T)u  -  0(T)T*x  -  Tx  -  Vj  (A-17) 

so  that  the  particular  solution  is 

*  ^{T)u  (A-IS) 

and  the  general  solution  is 


V  *  i/'(T)u  +  V(j  (A-19) 

where  v^  is  any  vector  in  the  null  space  to  T — i.e,,  such  that 

Tvj  -  0  (A.20) 

Ve  have,  then,  obtained  a  general  solution  of  Eq.  (A-1)  if  T  is  of 
simple  structure. 

This  may  seem  like  a  devious  way  of  going  about  it.  In  particular, 
we  have  spent  a  considerable  amount  of  time  justifying  the  substitution 
of  Tx  for  v,  which  was  then  immediately  removed  from  Eq.  (A>17). 

This  was,  however,  necessary.  It  was  necessary  because  the  minimum 
polynomial  of  a  singular  operator  contains  no  constant  term.  Therefore, 
we  cannot  find  a  function  </'(T)  such  that,  in  general, 

X  «  i//(T)Tx  (A-21) 

The  best  we  can  do  is  Eq.  (A-16),  in  which  the  right  side  contains  the 
factor  T*.  The  manipulation  through  the  vector  x  defined  suc)i  that 
Tx  -  Vj,  is  necessary  to  take  account  of  this  factor. 

If  T  is  non-singular,  we  can  be  certain  that  the  minimum  polynomial 
does  contain  a  non-zero  conatant  term,  and  a  i/'(T)  does  exist  such  that 
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Eq.  (A>21)  is  true.  In  this  esse,  the  solution  to  Eq.  (A>1)  is  obtsined 
by  premultiplying  Eq.  (A>1)  by  this  ip(T).  This  is  a  valid,  if  somewhat 
unorthodox  way  of  obtaining  T**^  for  a  non>aingular  operator  in  terms  of 
the  positive  powers  of  T. 

If,  now,  T  ia  not  only  singular  but  has  a  chain  of  length  k  with 
zero  eigenvalue,  then  the  minimum  polynomial  contains  the  factor  T^. 

Hence,  instead  of  Eq.  (A*16),  we  obtain 

T*x  -  (A-22) 

which  does  not  permit  us  to  use  the  same  procedure.  We  could  use  it  if 
we  could  make  the  substitution 

V  =  T*X  (A-23) 

This,  however,  excludes  the  possibility  that  V,  and  hence  u,  are  in  the 
entire  subapace  spanned  by  the  chain  of  generalized  eigenvectors  with 
zero  eigenvalue.  This  subspace  does  include  vectors  that  are  in  the 
range  of  T,  as  discussed  before.  Therefore,  the  v  so  found  is  not  a 
solution  for  any  u,  but  for  a  specifically  restricted  u.  It  is  for  this 
reason  that  we  must  seek  a  more  general  approach  to  this  problem. 

3.  GENERAL  T 

V.'e  can  handle  a  general  operator,  T,  including  those  with  chains  of 
generalized  eigenvectors  with  zero  eigenvalue,  by  consideration  of  the 
adjoint  operator. 

Before  doing  so,  we  define  the  orthogonal  complement  of  a  subspace. 

Definition:  Given  a  (proper)  inner  product  relation,  the  “orthogonal 
complement”  of  a  subspace,  is  the  subspace  of  all  vectors  that  are 

orthogonal  to  every  vector  in  That  is,  X  is  in  the  orthogonal  com¬ 

plement  of  if 

<X.  y>  ■  0 

for  every  y  in  Sj. 
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The  definition  impliee  that  the  orthogonal  conplenent  of  a  aubapace 
is  a  aubapace.  It  is  not  difficult  to  see  that  this  is  so  (providing  the 
whole  space  haa  a  finite  number  of  dimenaiona). 

We  ahall,  then,  meke  uae  of  the  following  theorem. 

Theortm  2:  Given  a  (proper)  inner  product  relation,  the  range  of  T  is 
the  orthogonal  complement  of  the  null  apace  of  T*,  where  T*  ia  the  oper¬ 
ator  that  is  adjoint  to  T  under  the  given  inner  product  relation. 

Thia  theorem,  then,  gives  us  a  connection  between  the  range  of  T  and 
a  aubapace  of  the  whole  apace  that  is  the  complete  domain  of  T.  We  shall 
see  later  how  to  uae  thia  connection. 

Before  proving  the  theorem  we  need  to  prove  the  “projection  theorem:’* 

Theorem  3:  Given  a  aubapace,  a  vector  x  not  in  5^,  and  a  (proper) 
inner  product  relation,  then  there  exists  a  unique  vector,  w,  in  S^, 
called  the  “projection  of  x  on  “  such  that 

<x-w,  y>  ■  0  (A-24) 


for  any  y  in  Sj . 

Consider  any  y  in  Then,  since  x  is  not  in  -Sp  (x  **  y)  is  not 

the  null  vector.  Hence 

<x-y,  x-y>?<0  (A-25) 

This  inner  product,  considered  as  a  function  of  y,  must  have  a 
greatest  lower  bound.  That  is,  there  must  exist  some  y^^  such  that 

<  X  -  y,,  X  -  yp  >  •  o  (A-26) 

where  a  is  a  positive  real  number,  and  such  that  for  any  y 

<x  -  y,  X  -  y  >  >  <x  -  y,,  X  -  y,  >  ■  a  (A-27) 

We  do  not,  at  this  point,  assert  that  y^  is  unique,  but  only  that  at 
least  one  such  y,  exists. 
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We  shall  show,  now,  that  is  the  projection. 

Any  vector  y  in  5^  can  be  expressed  as  a  linear  combination  of 
and  some  other  vector  s: 

y  -  yj  +  0*  (A-28) 


where  a  is  a  scalar.  Then 


<»-y,  x“y>  •  <x-y,  -otz.  x“yj-a*> 

-  <x-yj,  x-yj  >-«*<«,  x-yj>-a<x-y,.  *> 

+  *  >  >  <x  “  yo.  *  ”  yo  ^  (A-29) 

This  must  be  true  for  any  non-null  Z  and  any  d.  In  particular,  it 
must  be  true  for 

<*,  X  “  yo>  <x  -  yj,  *>* 
a  .  -  -  -  (A.30) 

<  X.  X  >  <  X,  X  > 


Substituting  this  in,  we  find  that 


Tq.  » 

<  X,  X  > 


>1* 


>  0 


(A-31) 


Tht  left  side  is  negative  semi-definite.  It  cannot  be  greater  than 
zero,  so  it  must  equal  zero.  Since  X  is  non-null  and  the  inner  product 
relation  is  proper— i.e.,  positive  definite — we  must  have 

<x  -  y,,  x>  -  0  (A- 32) 

Since  this  is  true  for  any  x  in  the  yj^  determined  as  a  vector 
giving  the  greatest  lower  bound  in  Eq.  (A-26),  is  a  projection  as  defined 
by  the  theorem. 

Suppose,  now,  y^  were  not  uniquely  defined.  Suppose  there  existed  a 
yj  also  giving  the  greatest  lower  bound  in  Eq.  (A-26).  Then  it,  too, 
would  obey  Eq.  (A*32): 
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(A- 33) 


for  any  c  in  5|. 

Subtracting  Eq.  (A*32)  froai  Eq.  (A>33),  we  obtain 

<  y#  •  Yi  »  >  ■  0  (A-34) 

But  Yg  and  y^  are  both  in  S^.  Therefore,  ao  ia  their  difference. 

Since  X  ia  any  vector  in  •$!,  we  can  take  it,  in  particular,  aa  y^  **  y^: 

“  >1*  y  “  Xi  ^  ■  0  (A-35) 

Hence  y^  **  yj  ia  the  null  vector,  and  yj  ia  identical  to  y^. 

Therefore,  a  projection  exiata  and  ia  unique. 

It  may  be  noted  that  the  theorem  concerna  a  vector  not  in  Sj. 

Thia  reacriction  can  be  immediately  removed,  however.  If  X  ia  in  Sj, 
then  we  define  it  as  ita  own  projection,  and  the  reault  becomes  trivial. 

We  use  thia  theorem,  now,  to  prove  the  following: 

Theorem  4:  Given  a  (proper)  inner  product  relation,  the  equation 

u  ■  Tr  (A«36) 

has  a  solution  for  a  given  u  if  and  only  if  ii  is  orthogonal  to  every 
w  that  is  a  solution  of 

T'w  -  0  ,  (A-37) 

To  prove  the  necessity  of  the  condition,  consider  any  vector  X  and 
any  w  that  is  a  solution  of  T*w  >  0: 

0  -  <T*w,  x>  •  <w,  Tx>  (A-38) 

Thia  is  true  for  any  x,  and,  in  particular,  for  X  ■  t: 

<w,  Tr>  •  <w,  M>  •  0  (A-39) 
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Thsrefore,  ■  i>  orthogonal  to  any  w  that  ia  a  aolution  of  the  hoaogeneoua 
adjoint  equation. 

To  prore  the  aufficiency  of  the  condition,  auppoae  that  ■  ia  orthogonal 
to  all  aolutiona,  «,  of  the  hoaogeneoua  adjoint  equation.  We  aak,  then,  if  it 
then  followa  that  n  ia  in  the  range  of  T. 

Now  the  range  of  T  ia  a  aubspace  aince,  if 

“i  ■  Ttj 

u,  «  Ttj  (A-40) 


then,  aince  T  ia  linear, 

flMj  +  Aij  -  T(avj  +  (A-41) 

That  ia,  if  Uj  and  Mj  are  in  the  range  of  T,  then  ao  ia  any  linear  combi¬ 
nation  of  Hj  and  Bj.  Hence,  the  range  ia  a  aubspace. 

Therefore,  the  projection  theorem  applies.  That  is,  if  X  is  any 
vector,  then  there  exists  a  in  the  range  of  T  such  that 

<a  -  a,,  Tx>  -  0 

-  <T*(a  -  a,),  x>  (A-42) 

This  is  true  for  all  x,  and  in  particular  for  x  >  T*(a  *'  w,).  Hence, 
this  vector  must  be  the  null  vector. 

T*(a  -  a,)  -  0  (A-43) 

Hence  (a  ~  Ug)  is  a  solution  of  the  homogeneous  adjoint  equation. 

By  assumption,  then,  (a  ~  Bg)  ia  orthogonal  to  u,  so  that 

<u-ag,  B>  ■  0 

■  <a“no,  u-Ug>  +  <B“Bg,  ao>  (A-44) 
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Hotrever,  itg  is  in  the  rsnge  of  T,  being  the  projection  of  ■  on  the 
rsnge.  Also,  (n  ~  Ug)  is.  by  the  definition  of  s  projection,  orthogonal 
to  all  vectors  in  the  range  of  T.  Hence,  the  second  term  above  vanishes, 
leaving  only 

< ■  “  ttj,  a  -  >  ■  0  (A-45) 

so  that 

a  •  Uj  (A-46) 

Therefore  u  is,  in  fact,  in  the  range  of  T. 

We  have  proven,  then,  both  the  sufficiency  and  the  necessity  of  the 
condition. 

The  theorem  that  we  originally  wanted  to  prove,  Theorem  2,  is  now 
aimply  a  reatotement  of  Theorem  4,  for  the  space  spanned  by  w  is  the  null 
apace  of  T*  and  the  condition  that  u  is  orthogonal  to  all  W  defines  the 
orthogonal  complement.  Hence,  the  range  of  T  is  the  orthogonal  complement 
of  the  null  space  of  T*. 

We  emphasize  that  our  entire  development  does  depend  on  the  positive 
definiteness  of  the  inner  product— t.  e. ,  that  it  be  “proper.”  With  this 
limitation,  however,  it  does  not  depend  on  the  particular  inner  product 
being  used.  As  the  inner  product  is  changed,  the  adjoint  operator  changes 
but  so  does  the  definition  of  orthogonality,  and  hence  of  the  orthogonal 
complement.  The  two  changes  complement  each  other  in  such  a  way  that  the 
range  of  T  as  the  orthogonal  complement  of  the  null  space  of  the  adjoint 
of  T  is  unaffected. 

This,  tlien,  gives  us  the  means  of  solving  Eq.  (A-1)  in  the  general 

case. 


We  have  that  u,  which  is  known  to  be  in  the  range  of  T,  ia  in  the 
orthogonal  complement  of  the  null  space  of  T*.  Hence,  we  can  premultiply 
by  Eq.  (A-1)  by  T*  and  find 

•  ■  T*u  ■  T*Tt  (A-47) 

and  know  that  w  ia  not  the  null-vector  if  u  ia  not. 
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Now  the  operator  T*T  ia  eelf-edjoint  under  an  inner  product  relation 
that  ia  aaauaed  to  be  proper: 

(T*T)*  -  T'T  (A- 48) 

Therefore,  it  ia  of  aimple  structure,  and  the  results  of  the  preceding 
section  apply.  That  is,  a  particular  solution  to  Eq.  (A>47)  is  obtained 
by  conaidering  4(T*T),  the  rainiouni  polynomial  of  T*T,  and  forming  i/'(T*T) 

!  from  it.  Thia  «e  can  do  aince,  firatly,  T*T  ia  singular  if  T  is,  so  that 

<^(T*T)  haa  no  constant  term,  and  secondly,  T^T  being  of  simple  structure, 
the  coefficient  of  the  term  in  the  first  power  of  T*T  is  not  zero.  Then 
we  have  that 

vi  -  i/>(T»T) 

-  (A.49) 

and  the  general  solution  ia 

V  -  0(T*T)T*u  +  Vj  (A-SO) 

where  is  any  vector  such  that 

f 

I  Tvj  •  0  .  (A-Sl) 

I  This,  then,  gives  us  a  general  solution  to  our  problem. 

I  4.  COMPARISON  OF  SOLUTIONS 

The  particular  solution  obtained  in  Eq.  (A-49)  is  more  involved  than 
that  of  Eq.  (A-18).  Equation  (A-49)  is,  of  course,  more  general,  in  that 
it  admits  a  T  of  any  structure.  But  suppose  T  is  of  simple  structure. 

We  may  ask  if  there  are,  in  thia  case,  problems  in  which  it  might  still 
be  worthwhile  to  use  the  formally  more  complicated  solution  of  Eq.  (A-49). 

The  two  particular  solutions  are  not,  in  general,  the  same.  By 
Eq.  (A-18),  we  find  the  particular  solution  that  is  in  the  range  of  T. 

In  Eq.  (A-49)  we  find  the  particular  solution  that  ia  in  the  range  of  T*, 
and  which  therefore  ia  in  the  orthogonal  complement  of  the  null  apace  of 
T.  These  two  solutions  will  be  the  same  only  if  the  operator  ia  normal 

iZ 


( 


with  r«apect  to  the  inner  product  relation  being  need.  If  the  operator 
ia  not  normal,  the  two  aolutiona  will  differ  by  a  rector  that  ia  in  the 
null  apace  of  T. 

The  formally  more  complex  aolution  of  Eq.  (A-49)  dependa  upon  the 
particular  choice  of  the  inner  product  relation.  We  can  auggeat  then 
that  it  may  be  uaeful,  even  when  Eq.  (A-18)  ia  uaable,  preciaely  becauae 
it  permita  ua  to  chooae  the  inner  product  relation  that  ia  moat  appro¬ 
priate  for  the  particular  problem.  It  haa,  in  other  worda,  a  built-in 
flexibility  that  may  be  of  value. 


63 


APPENDIX  n 

PROPEirriESOFIIlE  COMMUTATOR  OPERATOR 
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APPENDIX  B 


nUFfmESOFTK  commutatuk  operator 


In  this  appendix  we  shall 
commutator  operator.  That  is. 


discuss  some  of  the  properties  of  the 
we  shall  consider  the  equation 


u  -  T,W  -  -r,v  -  [V,f] 

-  (VW  -  WV) 


(B-l) 


so  that 

T.  -  [V.  ] 

T,  -  [f.  ]  (B-2) 


(Note  that,  for  convenience,  we  are  here  dropping  the  scalar  factor 
j  that  we  usually  include.  It  may  be  considered  as  being  included  in 
one  of  the  matrices.  } 

Such  equations  arise  in  a  number  of  different  contexts  in  coupled 
mode  theory,  mechanics,  quantum  mechanics,  and  other  fields. 

In  coupled  mode  theory,  it  arises  naturally  in  the  study  of  the 
noise  power  representations.  If,  for  example,  we  have  the  equation 


dx 

di 


-jRx 


(B-3) 


where  R  is  known  to  be  K-hermitian  for  some  constant,  non-singular, 
hermitian  K,  and  X  is  a  vector  represented  as  a  column  matrix,  we  can, 
if  we  like,  study  the  behavior  of  the  dyad 

W  »  x»*K  (B-4) 
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Than  we  aee  that 


di 


—  *♦»  +  * 

dt 


(r)' 


-  -;RufK  +  j*x*R*K 


■  -yftut^K  +  yxx^KR 


-  -jBi  +  jWR 

-  -y[R.W]  -  -;T,W  (B-S) 


For  the  atudy  of  noiae  behavior  in  a  ayatem,  W  is  a  more  convenient 
variable  aince  ita  terma  are  apectral  power  denaities,  and  thus  are 
meaningful  even  when  the  components  of  x  are  “pathological”  functions  of 
time. 

In  addition,  we  have  also  observed  that  the  general  case,  where  R 
in  Eq.  (M-3)  may  be  any  matrix  function  of  2,  is  extremely  complicated. 
However,  the  class  where  R(z)  is  not  unlimited  but  constrained  to  be  such 
that  ita  derivative  is  expressible  as  a  commutator  with  some  matrix  U, 


dR 
“  • 

dz 


jtR,U] 


(8-6) 


is  an  important  class  of  systems,  with  some  useful  simplicities.  It  in¬ 
cludes  such  non-uniform  systems  as  the  exponentially  tapered  transmission 
line,  the  distributed  parametric  amplifier,  and  the  like. 

A  third  aspect  of  the  use  of  commutator  operators  is  in  the  expansion 
of  certain  forms.  In  particular,  we  shall  see  that  the  form 

is  conveniently  expandable  in  terms  of  commutators. 

Our  purpose  here,  then,  is  to  establish  the  fundamental  properties 
of  these  operators  so  as  to  provide  the  basis  for  their  employment  in 
various  problems  of  analysis. 
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1.  THE  LINEAR  VECTOR  SPACE 

I  We  consider  Eq. 

I  U  -  T^W  -  [V.f]  (B-7) 

I 

i  where  we  regard  aa  the  operator  in  queation. 

1  The  operator  operatea  on  any  n  ^  n  matrix,  W,  to  generate  an 

r  n  ^  n  matrix  U. 

I 

I 

We  aaaert  that  the  apace  of  all  n  ^  n  matricea  ia  a  linear  vector 
apace  in  which  ia  a  linear  homogeneoua  operator. 

It  may  appear  strange  to  call  a  square  matrix  a  vector.  However 
the  properties  that  the  elements  of  a  set  must  have  for  the  set  to  be  a 
linear  vector  space  do  not  depend  on  the  form  of  the  representation.  We 
have  merely  adopted  as  a  convention  that  is  usually  convenient  the  form 
of  a  column  matrix.  This  is  usually  convenient,  but  not  always,  and  is 
not  necessary. 

For  a  set  to  be  a  linear  vector  space,  we  require  (1)  that  the 
operations  of  addition  of  two  members  and  of  multiplication  by  a  scalar 
be  defined,  with  the  usual  properties  of  addition  and  scalar  multiplica¬ 
tion,  (2)  that  the  sum  of  any  two  members  of  the  set  be  in  the  set,  and 
(3)  that  the  product  of  any  scalar  times  a  member  of  the  set  is  in  the  set. 

j  Tlie  set  of  all  n  x  n  matrices  with  elements  in  a  given  field  have 

these  properties  under  the  usual  matrix  addition,  and  scalar  multiplication, 

I  and  therefore  form  a  linear  vector  space. 

We  could,  if  we  liked,  represent  such  a  matrix  in  the  standard  form. 

;  We  could,  for  example,  form  a  column  vector  of  dimensionality  n*  in  which 

the  components  of  the  matrix  were  listed  in  some  prescribed  order.  This 
is  rarely  worth  doing,  but  does  illustrate  the  possibility. 

Of  greater  interest  is  the  possibility  of  expanding  an  arbitrary 
member  of  the  set  in  terms  of  some  convenient  complete  set  of  standard 
matrices.  For  example,  if  n  »  2,  so  that  we  are  considering  the  space  of 
2x2  matrices,  we  can  use  the  identity  and  the  Pauli  spin  matrices  used 
in  Sec.  II-B-4: 
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I 


(B-8) 


Then  any  matrix,  A,  can  be  expreased  as 

A  ■  *l®l  ^  ^  (B-9) 

and  «e  can  represent  A  as  coKoij,  ttj,  0,^,  OL^), 

More  generally,  it  is  aometimea  convenient  to  use  the  set  of  ff-dyada 
formed  from  some  complete  set  of  K'orthogonal  maximally  normalized  vectors. 

These  various  representations  are  possible  and  are  sometimes  con¬ 
venient.  However,  it  is  not  necessary  to  change  to  such  a  representation. 
For  the  immediate  purpose,  they  merely  serve  to  illustrate  the  fact  that 
the  set  of  all  n  ^  n  matrices  is  a  linear  vector  apace — a  fact  which  then 
is  independent  of  the  particular  representation  being  used.  We  shall,  in 
this  section,  use  the  representation  as  square  matrices. 

2.  LINEARITY  AND  HOMOGENEITY  OF  THE  COMMUTATOR 

We  have  asserted  that  the  operator,  T^,  is  linear  and  homogeneous. 

It  is  linear  because 

T,(aX  +  .^)  -  V(aX  +  /»¥)  -  (aX  +  a’YIV 

-  a(VX  -  XV)  +  /»(¥¥  -  YV) 

-  aT,X  +  /’T^Y  (B-10) 

It  is  homogeneous  since,  if  X  is  the  null  matrix,  then  T^X  is  the 
null  matrix. 

It  may  be  noted  that  this  implies  that,  if  X  be  represented  in  some 
manner  as  a  column  vector,  as  discussed  in  the  last  section,  then  the 
appropriate  representation  of  T,  is  a  matrix. 

If,  for  example,  we  are  considering  2^2  matrices  and  use  the  representa¬ 
tion  of  Eq.  (R-9)  in  terms  of  the  Pauli  matrices,  thenitisessy  to  see  that 
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If 


Ic  o)  I 


(B-11) 


(B-12) 


then  we  can  find  that,  in  this  representation 


T.X 


[V.« 


0  j(a  -  S)  -  y)  0 
-j(a  «  S)  0  X/’  +  r)  0 

(/?  -  y)  (0  +  y)  0  0 

0  0  0  0 


(B.13) 


Af^ain,  we  cite  this  only  ss  an  example. 

We  may  note  that  the  matrix  of  Eq.  (B-13)  is  of  rank  2.  It  has  the 
zero  eigenvectors 


X 


1 


(B-U) 


This  we  would  expect  since,  in  the  original  representation  of  as  the 
commutator,  X  ■  I  and  X  ■  V  both  commute  with  I'',  so  that  they  are  annihi¬ 
lated  by  the  commutation  operator. 

We  observe,  in  fact,  that  the  commutator  operator  is  necessarily 
singular.  The  subspace  spanned  by  I,  V,  V^,  ...  is  always  annihilated  by 
T^.  More  generally  the  null  apace  of  is  the  space  of  matrices  that 
commute  with  V,  and  this  is  always  a  proper  subspace. 

As  a  final  remark,  we  will  add  that  the  cross  product  of  the  vectors 
in  vector  analysis  can  be  set  up  as  a  commutator  relation  That  is,  if 
we  have 


U  ■  X  w 


(B-15) 
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it  ia  possible  to  choose  a  representation  of  the  vectors  as  matrices  so 
that  Eq.  (B>1S)  takes  the  form  of  Eq.  (B>7).  One  way,  for  example,  is 
by  using  the  isomorphism  of  Eq.  (B-11)  in  reverse,  after  reducing  the 
dimensionality  to  3  by  specifying,  for  example,  that  ■  0,  or  4  ■  -0. 
Thus,  Eq.  (B-7)  can  be  considered  as  a  generalisation  of  Eq.  (B*1S).  The 
singularity  of  ia,  then,  a  reflection  of  the  fact  that  the  cross 
product  of  Eq.  (B-15)  ia  singular,  and  vanishes  if  v  <jc  S. 

3.  INNER  PRODUCT  RELATION 

We  now  wish  to  obtain  an  inner  product  relation  what  will  metricize 
our  space  of  n  x  n  matrices.  For  this  purpose,  we  need  a  procedure 
whereby  we  can  obtain  from  two  elements,  U  and  V  of  the  space,  a  scalar 
valued  function  that  is  linear  in  U  and  V.  Specifically,  we  need  a 
scalar  valued  function  which  we  will  write  as<U,V>,  such  that 


<U.aV>  «  a<U,V> 

(B-16) 

<V  l]>  -  <u  v>* 

(R-17) 

<11,  (Vj  +  Vj)  >  .  <U,Vi  >+  <U,Vj  >  . 

(B-18) 

We  also  want  this  relation  to  be  positive  definite  so  that  the 
inner  product  relation  is  ‘‘^d  we  can  apply  the  results  of 

Appendix  A.  That  is,  we  want 

<U.U>  >  0  (B-10) 

with  the  equality  holding  if  and  only  if  U  is  the  null  matrix. 

Our  first  problem  is  how  to  obtain  a  scalar  valued  function  from 
matrices.  The  scalars  that  are  naturally  associated  with  a  matrix  are 
the  sums  of  the  h*rowed  principal  minors,  which  are  the  coefficients 
of  the  characteristic  equation  and  which  also  are  the  symmetrical  ele¬ 
mentary  functions  of  the  eigenvalues. 

Of  these  scalars,  only  the  first,  which  is  the  trace,  or  the  sum 
of  the  diagonal  terms,  is  linear  in  the  sense  that  the  trace  of  a  scalar, 
a,  times  a  matrix,  A,  is  d  times  the  trace  of  A. 
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So  it  is  reasonable  to  expect  that  *e  will  vent  to  use  the  trace  of 
some  coabination  of  U  and  V. 

One  coabination  that  will  work  is  the  product  of  and  U 
<U,V>  •  trlJtV. 

This  evidently  obeys  Eqs.  (B-16),  (B-17),  and  (B-18).  That  it  also  is 
positive  definite  can  be  seen  by  noting  that,  if  is  the  ij  coaponent 
of  U,  and  Vjy  of  V,  then 

<U,V>  .  tr  2 

•  tu'v..  (B-20) 

ij  '  *  ' ' 

and 

<U  U>  -  I  |f/^.  J*  >  0  (B-21) 

Other  inner  product  relations,  also  based  on  the  trace  of  the 
product  of  and  V,  but  with  appropriate  other  factors,  can  also  be 
found.  However,  it  is  not  apparent  that  these  are  useful,  and  we  shall 
not  go  into  them  here. 

We  will  note  for  future  reference  the  important  property  of  the 
trace  that 

tr  AB  - 

This  follows  since 

tr  AB  • 


It  is  this  property  that  is  involved  in  Eq.  (B-17).  However,  it 
should  also  be  noted  that  it  is  not  true  that  the  trace  of  the  product  of 
any  number  of  matrices  is  independent  of  the  order.  We  do  have,  by  re¬ 
peated  application  of  this  theorem,  that 


tr  BA 


(B.22) 


tr  ^  ^ij^jk 


2 


ij 


Hi* 


£  BA  .  •  tr  BA 

ij 


(B-23) 
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tr  ABC 


tr  CAB  •  tr  BCA 


but  thia  ia  not,  in  general,  equal  to  tr  CBA,  for  example.  The  trace  of 
the  product  of  a  number  of  factora  ia  independent  of  a  cyclic  permutation 
of  the  factora,  but  not,  in  general,  of  any  permutation. 

4.  THE  ADJOINT  OPERATOR 

Our  principal  intereat  in  obtaining  an  inner  product  relation  ia  to 
give  ua  the  adjoint  operator. 

We  will  ahow  that,  under  the  inner  product  relation  of  Sec.  3  above, 

T*  .  T  ,  -  [V*.]  (B-24) 

ia  the  adjoint  operator,  T*,  of 


T,  -  [V.  ]  (B-2S) 

Thia  followa  aince 

<U.T,W>  -  tr  {Ut(Vf  -  WV)} 

-  tr  U*VW  -  tr  U^WV 

-  tr  U»VW  -  tr  VU»W  (B-26) 

The  aecond  term  in  the  laat  line  ia  a  cyclic  permutation  of  the  term 
above  it,  and  ao  doea  not  change  the  trace.  It  followa  then,  that 

<U,T,W>  -  tr  {(UtV  -  VU*)W} 

-  tr  {(V*U  -  UV*)tW} 

•  <T  ,U,W>  (B-27) 

Thia  givea  ua,  then,  an  adjoint  operator  under  a  proper  inner 
product. 
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With  this  adjoint  operator  the  results  of  Appendix  A,  are  iMsdiately 
applicable.  For  exaaple,  according  to  Theoren  4,  Appendix  A,  we  know  that 
Eq.  (B-1)  is  soluble  for  W  if  and  only  if  U  is  orthogonal  to  every  matrix 
that  commutes  with  V^, 

In  particular,  if  we  consider  the  equation 

<A 

—  -  j[R.A]  -  (B-28) 

we  find  that  it  is  soluble  for  A  if  and  only  if  R'  ■  dR/dx  is  orthogonal 
to  each  of  the  sequence  R^**  ■  I,  R^,  R^*,  ...  Hence 

<I,R'>  •  tr  R'  -  e 

<R»,R'>  -  tr  (HR')  -  0 

<R»*.R'>  -  tr  (R*R')  -  0 

etc.  (B-29) 

Since,  for  example,  from  Eq.  (n-23) 

tr  (RR')  -  tr  (R'R)  -  —  tr  (HR'  +  R'R) 

2 

■  ■—  {tr  (R*)'}  (B-30) 

it  follows  that  the  derivatives  of  all  the  powers  of  R  must  have  sero 
trace.  It  follows  directly  that  the  eigenvalues  of  R  must  be  constant. 

Given,  then,  that  R  has  constant  eigenvalues  so  that  Eq.  (B*28)  is 
soluble,  we  can,  then,  solve  Eq.  (B>28)  for  A  by  the  methods  of  the  last 
section.  If  is  an  operator  of  simple  structure,  or  at  least  has  no- 
chain  of  generalised  eigenvectors  of  sero  eigenvalue  then  there  exists  a 
polynomial  function  of  1],  ,  i/'(Tj| )  such  that 

A  -  ./r(T,  )R'  +  A, 

where  A,  is  a  matrix  in  the  null  space  of  . 
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It  ahould  be  noted  that  for  example,  ia  the  reault  of  applying 
Tji  twice.  Thua  it  ia 

T,*A  •  [R.T,A] 

•  [R(R.A)]  -  [jR,A]  (B-31) 

where  the  aymbol  meana  the  aecond  commutator. 

The  power  of  any  operator  T  doea  not  mean  the  power  of  a  matrix, 
unleaa  the  operator  ia  matrix  multiplication  by  a  matrix.  It  meana, 
inatead,  the  repeated  application  of  the  operator. 

We  ahall  not  puraue  the  development  of  Eq.  (B-28)  further  at  the 
moment.  We  ahall  only  note  that,  for  R  to  be  auch  aa  to  permit  ita 
variation  to  be  deacribed  by  Eq.  (B-28),  it  ia  neceaaary  and  aufficient 
for  the  trace  of  R^(ii  ■  1,  2,  .  .  . )  to  be  conatant. 

5.  EIGENVECTORS  AND  EIGENVALUES,  V  OF  SIMPLE  STRUCTURE 

We  have  found  that  the  operator  ia  linear  and  homogeneoua. 
Therefore,  it  haa  at  leaat  one  eigenvector.  That  ia,  there  ia  at  leaat 
one  and  one  auch  that 


T  U 


a 


(P-32) 


(The  reaaon  for  uaing  the  double  index  will  become  apparent  ahortly.  ) 

The  fact  that  the  are  matricea  doea  not  affect  their  being 

eigenvectora  in  the  apace  of  n  x  n  matrices. 

If  we  use  the  dyad  expansion  on  the  eigenvectors  of  V,  V  being 
assumed  to  be  of  simple  structure  then 


V  *  Z  a  A  .K 

j  I  <  < 


if  K  is  a  metric  for  which  V  is  at  least  A-normal.  The  set  are 

Af-orthogonal  and  maximally  normalized. 
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Thao 


(B-33) 


is  an  eigenvector  of  with  eigenvalue  **  for 


-  -  \)E,, 


(B-34) 


Further,  aince  the  aet  (E^y)  ia  complete,  theae  are  the  entire  set 
of  eigenvectors  of  T^,  and  is  of  simple  structure  if  V  is. 

The  null  space  of  is  the  space  apanned  by  the  set  plus 

any  additional  terms  due  to  degeneracies. 

The  minimum  polynomial  of  ia  the  product 


n(T,  -  M,yl) 


(B-35) 


over  all  terms  with  distinct  M<y  However,  for  each  we  evidently  have 


■  ~u.  . 


(B-36) 


The  minimum  polynomial  contains  the  single  factor  T^,  corresponding  to 
the  xero  eigenvalue.  The  other  factors  then  occur  in  pairs: 


(T, 


(B.37) 


Hence  the  minimum  polynomial  contains  only  odd  powers  of  T^,  and  can  be 
written  as 

<^(T,)  -  {o,  +  (o,  +  OjT*  +  ...)T»}T 


«,tl  -  e(T)T*]T 


(B-38) 


77 


V 


where 

^(T)  -  “—(«,  +  o.T>  +  ...)  (B.39) 

«i  *  * 

Ae  discueaed  in  Sec.  doea  not  vaniah  providing  ia  of 

ainple  atructure,  aa  we  have  ahown  it  ia  if  V  ia  of  aimple  atructure. 
Hence,  d(T)  exiata  and  Eq.  (B>38)  ia  valid.  In  other  worda,  if  V  ia  of 
aimple  atructure,  we  can  alwaya  find  a  polynomial  function  of  T^,  d(T^), 
auch  that,  for  any  U 


T^U  -  ^{T,)T»U  . 

We  ahall  not  diacuaa  the  more  general  caae  where  V  may  not  be  of 
aimple  atructure,  Esaentially  the  came  reaulta  apply  except  that  if  V 
ia  not  of  aimple  atructure,  then  ia  not  either,  and,  in  fact,  muat 
then  have  a  chain  of  generalized  eigenvectora  with  zero  eigenvaluea. 
Therefore,  we  muat  always  uae  the  more  general  procedure  for  ita 
'* inveraion "  aa  diacuaaed  in  Appendix  A. 
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